AGEofLLMs.com
Search

AI News and Trends: Large Language Models (LLMs)

Different models of artificial intelligence by various developers.

Latest in this category
Llama 4 Is Here and It's Not Messing Around

Meta's Llama 4 just dropped with massive models like Scout Maverick and Behemoth. Long context fast speeds and wild benchmarks. Here’s what you need to know.

read more
Mercury: Diffusion Large Language Models Are Here

Mercury by Inception Labs is the first diffusion-based LLM, generating text 10x faster and cheaper than traditional models. Instead of building sentences word by word it creates full responses at once then refines them step by step.

8 Mar 2025

OpenAI Drops SWE-Lancer: A New Benchmark for AI Coders

OpenAI just launched SWE-Lancer, a new benchmark testing AI coding skills with real freelance tasks worth $1M. Can AI handle real-world software jobs?

19 Feb 2025

Agent Leaderboard: How AI Agents Stack Up in Real-World Tasks

Galileo AI just dropped the Agent Leaderboard—a way to rate 17 top AI models across 14 varied datasets. It checks how well they handle tool-calling tasks, which matter a lot when building AI agents.

17 Feb 2025

openai
Deep Research. ChatGPT Just Got a Huge Upgrade

OpenAI has launched "Deep Research" for ChatGPT, an AI tool that digs through the web, analyzes data, and delivers detailed reports in minutes. Early users say it’s a game changer for research.

4 Feb 2025

openai
OpenAI o3-mini: Smarter AI for STEM at Lower Cost

OpenAI o3-mini is the latest reasoning model built for math, science, and coding. Faster, more accurate, and now available in ChatGPT and the API.

1 Feb 2025

Meet Kimi K1.5 – A Multi-Modal AI (China is Moving Fast!)

Kimi K1.5 by MoonshotAI is a cutting-edge multimodal AI handling text and images. With 128k token context and open-source access, it’s set to change AI innovation.

27 Jan 2025

deepseek
DeepSeek-R1: Open-Source Reasoning AI

Meet DeepSeek-R1, the open-source AI rivaling OpenAI's o1. It uses reinforcement learning to master reasoning and excels in benchmarks like math and coding tasks.

21 Jan 2025

bytedance
China’s AI Struggles: Doubao, Military Ambitions, and the Gap with U.S. Innovation

China’s AI faces challenges. From Doubao’s chatbot flaws to military tools like ChatBIT. How the U.S. leads in AI innovation with ChatGPT setting the standard.

14 Jan 2025

microsoft
Microsoft Releases Phi-4: A 14B Parameter Open-Source AI Model

Microsoft’s new 14-billion parameter AI model, Phi-4, is now open-source with full weights on Hugging Face under an MIT License. This efficient model excels in reasoning, coding, and math while using fewer resources.

9 Jan 2025

openai agi
OpenAI Drops o3

OpenAI’s o3 model is out and it’s breaking barriers in coding, science, and math and showing us what AI can really do.

21 Dec 2024

gemini google
Gemini 2.0 Flash: Google’s Next Big Leap

Create and edit images with Gemini 2.0 Flash—Google’s latest AI tool blending text and visuals. Explore real-time editing, multimodal outputs, and innovative features for your projects.

16 Dec 2024

openai
Canvas Gets Upgraded: Code Execution, Collaborative Editing, and More

ChatGPT Canvas updates: run Python code, edit collaboratively with custom GPTs, and access the tool on both web and desktop apps.

12 Dec 2024

alibaba
Alibaba’s Open-Source QwQ-32B Preview AI Model Challenges ChatGPT

Discover Alibaba’s QwQ-32B Preview AI, an open-source model delivering groundbreaking performance in reasoning, math, and programming. See how it compares to OpenAI’s ChatGPT.

30 Nov 2024

openai gpt-o1
Comparing Reasoning Patterns: OpenAI’s o1 Model vs. Test-time Compute Methods

A study compares OpenAI's o1 model with methods like Best-of-N and Agent Workflow, showing how o1 excels in math and coding tasks through reasoning patterns like Divide and Conquer and Self-Refinement.

24 Oct 2024

claude anthropic
Anthropic Rolls Out Claude 3.5 Sonnet Upgrade, New Claude 3.5 Haiku, and Computer Control Feature

Anthropic rolls out the upgraded Claude 3.5 Sonnet, new Claude 3.5 Haiku, and introduces the revolutionary computer use feature in public beta.

23 Oct 2024

mistral
Mistral drops new AI models optimized for mobile devices

French startup Mistral releases Les Ministraux AI models for edge devices, offering compute-efficient, low-latency solutions with impressive performance in text and coding tasks.

18 Oct 2024

ARIA - New Open-Source AI Drops

Meet Aria, the first open-source multimodal Mixture-of-Experts LLM, with promising benchmarks in handling text, images, videos, and code. Independent testing will be crucial to confirm its potential.

17 Oct 2024

openai
People Keep Leaving OpenAI. What's Going On?

OpenAI faces new challenges as top talent leaves for competitors. This post explores how the loss of key personnel might affect OpenAI’s innovation and leadership in the AI industry.

7 Oct 2024

gpt-4 openai
Canvas: ChatGPT’s New Tool for Writing and Coding

ChatGPT’s new Canvas feature enhances AI collaboration for writing and coding tasks with a dedicated workspace, version control, and contextual understanding—making it easier to create and manage projects.

6 Oct 2024

Liquid AI’s new Liquid Foundation Models

Liquid AI’s new Liquid Foundation Models offer cutting-edge performance and memory efficiency, with benchmarks that surpass industry leaders. Learn about their unique architecture and upcoming event at MIT.

3 Oct 2024

llama meta
Llama 3.2 Drops

Llama 3.2 is out, introducing both small and medium vision models (11B and 90B) along with lightweight text-only models.

27 Sep 2024

mistral
Pixtral 12B: Mistral AI's First Multimodal AI Model

Meet Mistral AI's Pixtral 12B, a groundbreaking open-source multimodal AI model designed to excel in both text and image processing tasks with 12 billion parameters. Mistral AI launched its first multimodal AI model, Pixtral 12B, which can process both text and images.

25 Sep 2024

openai gpt-o1
GPT o1 Released. First Impressions and Feedback

OpenAI introduces a new series of AI models designed to spend more time thinking before they respond. They can reason through complex tasks and solve harder problems than previous models in science, coding, and math.

15 Sep 2024

openai gpt-4
Increase Performance and Accuracy with Fine-Tuned GPT-4o

New feature lets developers tweak the model’s behavior to better fit their app or organization’s needs.

21 Aug 2024

gpt-4 claude
AI's Shared Imagination: Study Reveals Surprising Similarities in AI Creativity

Recent research uncovers an unexpected 'shared imagination' among AI models, raising questions about the future of AI creativity and innovation. Here's the gist of that study and what it means for the evolution of artificial intelligence.

14 Aug 2024

mistral
Mistral Large 2 Released

Mistral AI has introduced Mistral Large 2, a model with a 128k context window and 123 billion parameters, designed for high-performance AI applications. It supports numerous languages and coding languages, aiming for efficient single-node inference.

29 Jul 2024

meta llama
Llama 3.1 Release and Open Source AI

Meta has released Llama 3.1, an open-source AI model comparable to frontier models. The company aims to make Llama the industry standard for open source AI.

24 Jul 2024

claude gpt-4
ZebraLogic is Testing LLMs with Logic Puzzles

ZebraLogic benchmark uses logic puzzles to assess how well large language models (LLMs) can reason logically. It aims to evaluate AI systems' ability to solve complex problems that require logical thinking.

23 Jul 2024

anthropic claude
Claude Android App Drops

Anthropic brings Claude, including its most advanced version called Claude 3.5 Sonnet, to Android devices.

19 Jul 2024

meta llama
Meta's AI Regulatory Challenges in Europe

Meta halts the release of its new AI models in the EU due to regulatory uncertainties, affecting the launch of the highly anticipated LLAMA 3 model.

19 Jul 2024

openai
Former OpenAI Staff Claim Company Used Illegal NDAs

Whistleblowers have accused OpenAI, the company behind ChatGPT, of using illegal non-disclosure agreements (NDAs) that prevented employees from speaking out about safety risks related to their AI technology.

14 Jul 2024

openai
Reasoners: GPT-5 Details Are Revealed

OpenAI believes it's close to achieving GPT-5 capable of human-level problem-solving at the level of PhD.

12 Jul 2024

RouteLLM: Cost-Effective AI Query Routing with Near GPT-4 Quality

RouteLLM is an open-source framework designed for cost-effective LLM routing, enabling high-quality AI performance at a significantly reduced cost.

9 Jul 2024

claude
Boom! Claude 3.5 Sonnet Is Out!

Claude 3.5 Sonnet has just dropped, and it's funnier, faster, and smarter than ever!

22 Jun 2024

lechat mistral
Mistral AI launches Mistral Large & Le Chat

Mistral AI, is making a significant leap forward by introducing Mistral Large, positioning itself as a contender among elite large-language models, and revealing a beta version of "Le Chat," its user-centric chatbot aimed at competing with the market dominator, Open AI's ChatGPT.

2 Mar 2024

anthropic claude
Claude 3: Anthropic's Latest AI Marvel Outshines Rivals

Anthropic, backed by companies like Google and Amazon, released Claude 3 as its latest flagship GenAI model, positioning it as a strong contender in the AI landscape

6 Mar 2024