Rogue Scholar

Artificial IntelligenceTocInformática y Ciencias de la InformaciónInglés

ChatHuman: Revolutionizing 3D Human Understanding with RAG

https://doi.org/10.59350/2fy1q-a0s37

Publicado 17 de mayo de 2024

Autor Vaibhav Khobragade

Meet the General Specialist: Where AI Generalists Harness Specialist Tools for Unmatched Precision

Artificial IntelligenceTocInformática y Ciencias de la InformaciónInglés

CompactifAI: Large Language Models Don’t Actually Have To Be Large

https://doi.org/10.59350/mn7pe-zhx56

Publicado 16 de mayo de 2024

Autor Amanda Kau

A novel compression technique ensuring comparable performance with 70% less parameters

Retrieval-augmented-genArtificial-intelligenceLarge-language-modelsKnowledge-graphInformática y Ciencias de la InformaciónInglés

The integration of large language models (LLMs) with Neo4j-based knowledge graphs

https://doi.org/10.59350/ceh3e-3qj55

Publicado 16 de mayo de 2024

Autor Wenyi Pi

Enhancing Data Interactivity with LLMs and Neo4j Knowledge Graphs Author Wenyi Pi ( ORCID : 0009–0002–2884–2771) Introduction Since OpenAI launched ChatGPT, a large language model (LLM) based chatbot, in 2023, it has set off a technological wave.

Large-language-modelsArtificial-intelligencePrompt-engineeringInformática y Ciencias de la InformaciónInglés

Prompt engineering: A Way to Smartly Use AI

https://doi.org/10.59350/etrxr-9v423

Publicado 14 de mayo de 2024

Autor Dhruv Gupta

Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction Large Language Models (LLMs) have become the new face of Natural language processing (NLP). With their generative power and ability to comprehend human language, the human reliance on these models is increasing every day. However, the LLMs have been known to hallucinate and thus produce wrong outputs.

Artificial IntelligenceTocInformática y Ciencias de la InformaciónInglés

How to use Large Language Models to tag your data: A complete tutorial

https://doi.org/10.59350/z1z3k-rrm02

Publicado 12 de mayo de 2024

Autor Xuzeng He

Using Mistral for Data tagging

Knowledge GraphTocInformática y Ciencias de la InformaciónInglés

Automated Knowledge Graph Construction with Large Language Models — Part 2

https://doi.org/10.59350/4c2mx-vm853

Publicado 12 de mayo de 2024

Autor Amanda Kau

Harvesting the Power and Knowledge of Large Language Models

MegalodonLong-textsTransformer-architectureInformática y Ciencias de la InformaciónInglés

The longer the context, the better? Unlimited Context Length in Megalodon

https://doi.org/10.59350/dx6a6-yy475

Publicado 7 de mayo de 2024

Autor Qingqin Fang

An improvement architecture superior to the Transformer, proposed by Meta Author · Qingqin Fang ( ORCID: 0009–0003–5348–4264) Introduction Recently, researchers from Meta and the University of Southern California have introduced a model called Megalodon. They claim that this model can expand the context window of language models to handle millions of tokens without overwhelming your memory.

Large-language-modelsArtificial-intelligenceTransformersNatural-language-processInformática y Ciencias de la InformaciónInglés

Brief Introduction to the History of Large Language Models (LLMs)

https://doi.org/10.59350/m4c7t-epg97

Publicado 7 de mayo de 2024

Autor Wenyi Pi

Understanding the Evolutionary Journey of LLMs Author Wenyi Pi ( ORCID : 0009–0002–2884–2771) Introduction When we talk about large language models (LLMs), we are actually referring to a type of advanced software that can communicate in a human-like manner. These models have the amazing ability to understand complex contexts and generate content that is coherent and has a human feel.

Natural-language-processiTransformersArtificial-intelligenceInformática y Ciencias de la InformaciónInglés

Transformers Models in NLP

https://doi.org/10.59350/c7nrg-xay43

Publicado 7 de mayo de 2024

Autor Dhruv Gupta

Attention mechanism not getting enough attention Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction As discussed in this article, RNNs were incapable of learning long-term dependencies. To solve this issue both LSTMs and GRUs were introduced. However, even though LSTMs and GRUs did a fairly decent job for textual data they did not perform well.

Artificial IntelligenceTocInformática y Ciencias de la InformaciónInglés