Informatique et sciences de l'informationAnglaisGhost

Research Graph

Research Graph
Research Graph
Page d'accueilFlux RSS
language
Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié

Introduction In the rapidly evolving landscape of artificial intelligence, Large Language Models (LLMs) have emerged as powerful tools capable of understanding and generating human-like text. While cloud-based services offer convenient access to these models, there’s a growing demand for local, hands-on experimentation.

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié
Auteur Tarun Krishnan

Introduction The production of multimodal models has surged in recent years, with companies racing to develop models that can seamlessly handle both text and image instructions with high accuracy. On September 17, 2024 , Mistral introduced Pixtral , a lightweight yet powerful multimodal model that stands out from the crowd.

Artificial IntelligenceTocAnglais
Publié
Auteur Aditya Iyengar

Introduction In recent years, artificial intelligence (AI) has made astounding strides in transforming industries from healthcare to creative arts. Now, the gaming industry is poised for a revolution thanks to GameNGen — a pioneering project from Google Research and Tel Aviv University.

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié
Auteur Tohfa Siddika Barbhuiya

Introduction Molmo AI on PixMo Dataset is a state-of-the-art, open-source multimodal model developed by the Allen Institute for AI (Ai2). Designed to rival proprietary models like GPT-4 and Claude, Molmo offers powerful image and text comprehension capabilities at a fraction of the cost and complexity of its competitors.

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié

Introduction In the era of large models, the Transformer architecture, introduced in Google’s groundbreaking 2017 paper “Attention Is All You Need,” has become the mainstream. However, Liquid AI, a startup founded by former researchers from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL), is taking a different path.

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié

Introduction In the rapidly evolving landscape of artificial intelligence, developers and businesses face a common challenge: bridging the gap between cutting-edge AI models and practical, real-world applications. As the complexity and computational demands of AI continue to grow, so does the need for efficient, scalable, and user-friendly platforms that can harness the power of these models.

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié
Auteur Wendi Fan

Introduction As the demand for more intelligent natural language processing (NLP) systems grows, researchers have developed benchmarks to assess the capabilities of these models across a wide range of linguistic tasks. The General Language Understanding Evaluation (GLUE) benchmark is one of the most influential benchmarks created to address this need.

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié

Introduction At the recently concluded Meta Developer Conference, Llama 3.2 made a dazzling debut. This time, not only does it boast multimodal capabilities , but it has also partnered with companies like Arm to launch a “mobile” version optimised specifically for Qualcomm and MediaTek hardware.

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié
Auteur Tohfa Siddika Barbhuiya

Introduction to Llama 3.2 Llama 3.2 is Meta’s first open-source AI model capable of processing both text and images . This collection ranges from lightweight versions for edge devices to powerful multimodal models capable of sophisticated reasoning tasks.Key Features and Improvements Multimodal Capabilities For the first time in the Llama series, the 11B and 90B models support

Artificial IntelligenceTocInformatique et sciences de l'informationAnglais
Publié
Auteur Wendi Fan

Introduction In the rapidly evolving landscape of AI language models, Qwen has firmly established itself as a leader, consistently pushing the boundaries of innovation. Building on the success of Qwen2, which garnered widespread adoption from developers worldwide, the release of Qwen2.5 marks a groundbreaking milestone — one of the largest open-source contributions to date.

Artificial IntelligenceTocAnglais
Publié
Auteur Aditya Iyengar

Introduction Harnessing the capabilities of Stable Diffusion for generating customised images requires a nuanced understanding of prompt crafting. This guide delves deeply into the art and science of prompting techniques, offering structured approaches and advanced strategies for achieving precision and creativity in AI-generated imagery.