Historical record of major AI breakthroughs and milestones
Category:
Showing 31 of 31 developments
2025(4 developments)
Image GenerationRecent
Google's 'Nano Banana' AI Image Editor Launches in Gemini 2.5 Flash
Google releases Gemini 2.5 Flash Image (codenamed 'Nano Banana'), a breakthrough AI image editor that excels at maintaining character consistency while enabling natural language transformations and multi-image blending. Available via Gemini API at $0.04 per image.
OpenAI unveils GPT-5, breaking the 98% barrier on HumanEval with 98.2% and achieving 99.1% on GSM8K. The model demonstrates unprecedented capabilities in code generation and mathematical reasoning.
Claude Opus 4.1: Anthropic's Latest Flagship Model
Anthropic releases Claude Opus 4.1 with breakthrough performance, achieving 97.5% on HumanEval and 98.7% on GSM8K. Features enhanced reasoning capabilities and improved long-context understanding.
Genie 3: Google DeepMind's Interactive World Model
DeepMind unveils Genie 3, a foundation world model generating real-time interactive environments at 24 fps from text prompts. The model represents a crucial stepping stone toward AGI with auto-regressive frame generation and physical consistency without explicit 3D representation.
Gemini 2.0: Native Tool Use and Agentic Capabilities
Google releases Gemini 2.0 with native tool use capabilities, enabling AI agents to interact with external systems and perform complex multi-step tasks autonomously.
Anthropic introduces computer use capabilities, allowing Claude to control computers through screenshots, opening new possibilities for automation and accessibility.
OpenAI introduces GPT-4o with native multimodal capabilities, processing text, vision, and audio in a unified architecture for more natural interactions.
AI Index 2024: AI Beats Humans on Several Benchmarks
Stanford HAI's AI Index reveals AI now surpasses human performance on several benchmarks including image classification, visual reasoning, and English understanding, but still lags in complex tasks like math and planning.
Anthropic releases the Claude 3 model family with vision capabilities, offering different sizes for various use cases from edge devices to complex reasoning tasks.