2026 年 3 月 – 第 2 页

admin
27 3 月, 2026
0 Comments

A Coding Implementation to Run Qwen3.5 Reasoning Models Distilled with Claude-Style Thinking Using GGUF and 4-Bit Quantization

In this tutorial, we work dire…

admin
27 3 月, 2026
0 Comments

Watch James Manyika talk AI and creativity with LL COOL J.

In the latest episode of our D…

admin
27 3 月, 2026
0 Comments

Transform your headphones into a live personal translator on iOS.

Google Translate’s Live transl…

admin
26 3 月, 2026
0 Comments

Tencent AI Open Sources Covo-Audio: A 7B Speech Language Model and Inference Pipeline for Real-Time Audio Conversations and Reasoning

Tencent AI Lab has released Co…

admin
26 3 月, 2026
0 Comments

How to Build a Vision-Guided Web AI Agent with MolmoWeb-4B Using Multimodal Reasoning and Action Prediction

In this tutorial, we explore M…

admin
26 3 月, 2026
0 Comments

Lyria 3 Pro: Create longer tracks in more Google products

We are bringing Lyria 3 to the…

admin
26 3 月, 2026
0 Comments

Build with Lyria 3, our newest music generation model

Lyria 3 is now available in pa…

admin
25 3 月, 2026
0 Comments

NVIDIA AI Introduces PivotRL: A New AI Framework Achieving High Agentic Accuracy With 4x Fewer Rollout Turns Efficiently

Post-training Large Language M…

admin
25 3 月, 2026
0 Comments

Google Introduces TurboQuant: A New Compression Algorithm that Reduces LLM Key-Value Cache Memory by 6x and Delivers Up to 8x Speedup, All with Zero Accuracy Loss

The scaling of Large Language …

admin
25 3 月, 2026
0 Comments

Paged Attention in Large Language Models LLMs

When running LLMs at scale, th…

admin
25 3 月, 2026
0 Comments

A Coding Implementation to Design Self-Evolving Skill Engine with OpenSpace for Skill Learning, Token Efficiency, and Collective Intelligence

In this tutorial, we explore O…

admin
25 3 月, 2026
0 Comments

This AI Paper Introduces TinyLoRA, A 13-Parameter Fine-Tuning Method That Reaches 91.8 Percent GSM8K on Qwen2.5-7B

Researchers from FAIR at Meta,…

admin
24 3 月, 2026
0 Comments

Yann LeCun’s New LeWorldModel (LeWM) Research Targets JEPA Collapse in Pixel-Based Predictive World Modeling

World Models (WMs) are a centr…

admin
24 3 月, 2026
0 Comments

Meta AI’s New Hyperagents Don’t Just Solve Tasks—They Rewrite the Rules of How They Learn

The dream of recursive self-im…

admin
24 3 月, 2026
0 Comments

Luma Labs Launches Uni-1: The Autoregressive Transformer Model that Reasons through Intentions Before Generating Images

In the field of generative AI …

Other Story

AI relative

Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool Invocation, and Autonomous Iteration on the Bailian Platform

admin
2 6 月, 2026

AI relative

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

admin
2 6 月, 2026

AI relative

How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Native torch.amp

admin
2 6 月, 2026

AI relative

MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multimodality, and Agentic Coding

admin
2 6 月, 2026

AI relative

Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Top of Hermes Agent

admin
2 6 月, 2026

AI relative

How we used Gemini to build Google I/O 2026

admin
2 6 月, 2026

AI日报汇

AI日报汇

Archives 3 月 2026

A Coding Implementation to Run Qwen3.5 Reasoning Models Distilled with Claude-Style Thinking Using GGUF and 4-Bit Quantization

Watch James Manyika talk AI and creativity with LL COOL J.

Transform your headphones into a live personal translator on iOS.

Tencent AI Open Sources Covo-Audio: A 7B Speech Language Model and Inference Pipeline for Real-Time Audio Conversations and Reasoning

How to Build a Vision-Guided Web AI Agent with MolmoWeb-4B Using Multimodal Reasoning and Action Prediction

Lyria 3 Pro: Create longer tracks in more Google products

Build with Lyria 3, our newest music generation model

NVIDIA AI Introduces PivotRL: A New AI Framework Achieving High Agentic Accuracy With 4x Fewer Rollout Turns Efficiently

Google Introduces TurboQuant: A New Compression Algorithm that Reduces LLM Key-Value Cache Memory by 6x and Delivers Up to 8x Speedup, All with Zero Accuracy Loss

Paged Attention in Large Language Models LLMs

A Coding Implementation to Design Self-Evolving Skill Engine with OpenSpace for Skill Learning, Token Efficiency, and Collective Intelligence

This AI Paper Introduces TinyLoRA, A 13-Parameter Fine-Tuning Method That Reaches 91.8 Percent GSM8K on Qwen2.5-7B

Yann LeCun’s New LeWorldModel (LeWM) Research Targets JEPA Collapse in Pixel-Based Predictive World Modeling

Meta AI’s New Hyperagents Don’t Just Solve Tasks—They Rewrite the Rules of How They Learn

Luma Labs Launches Uni-1: The Autoregressive Transformer Model that Reasons through Intentions Before Generating Images

Other Story

Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool Invocation, and Autonomous Iteration on the Bailian Platform

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Native torch.amp

MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multimodality, and Agentic Coding

Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Top of Hermes Agent

How we used Gemini to build Google I/O 2026