Bilgisayar ve Bilişim BilimleriİngilizceGhost

Research Graph

Research Graph
Research Graph
Ana SayfaRSS Besleme
language
Retrieval-augmented-genArtificial-intelligenceLarge-language-modelsKnowledge-graphBilgisayar ve Bilişim Bilimleriİngilizce
Yayınlandı
Yazar Wenyi Pi

Enhancing Data Interactivity with LLMs and Neo4j Knowledge Graphs Author Wenyi Pi ( ORCID : 0009–0002–2884–2771) Introduction Since OpenAI launched ChatGPT, a large language model (LLM) based chatbot, in 2023, it has set off a technological wave.

Large-language-modelsArtificial-intelligencePrompt-engineeringBilgisayar ve Bilişim Bilimleriİngilizce
Yayınlandı

Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction Large Language Models (LLMs) have become the new face of Natural language processing (NLP). With their generative power and ability to comprehend human language, the human reliance on these models is increasing every day. However, the LLMs have been known to hallucinate and thus produce wrong outputs.

MegalodonLong-textsTransformer-architectureBilgisayar ve Bilişim Bilimleriİngilizce
Yayınlandı

An improvement architecture superior to the Transformer, proposed by Meta Author · Qingqin Fang ( ORCID: 0009–0003–5348–4264) Introduction Recently, researchers from Meta and the University of Southern California have introduced a model called Megalodon. They claim that this model can expand the context window of language models to handle millions of tokens without overwhelming your memory.

Large-language-modelsArtificial-intelligenceTransformersNatural-language-processBilgisayar ve Bilişim Bilimleriİngilizce
Yayınlandı
Yazar Wenyi Pi

Understanding the Evolutionary Journey of LLMs Author Wenyi Pi ( ORCID : 0009–0002–2884–2771) Introduction When we talk about large language models (LLMs), we are actually referring to a type of advanced software that can communicate in a human-like manner. These models have the amazing ability to understand complex contexts and generate content that is coherent and has a human feel.

Natural-language-processiTransformersArtificial-intelligenceBilgisayar ve Bilişim Bilimleriİngilizce
Yayınlandı

Attention mechanism not getting enough attention Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction As discussed in this article, RNNs were incapable of learning long-term dependencies. To solve this issue both LSTMs and GRUs were introduced. However, even though LSTMs and GRUs did a fairly decent job for textual data they did not perform well.