AI日报汇

admin
16 1 月, 2026
0 Comments

NVIDIA AI Open-Sourced KVzap: A SOTA KV Cache Pruning Method that Delivers near-Lossless 2x-4x Compression

As context lengths move into t…

admin
16 1 月, 2026
0 Comments

How Nano Banana got its name

We’re peeling back the origin …

admin
15 1 月, 2026
0 Comments

Learners and educators are AI’s new “super users”

Google’s 2025 Our Life with AI…

admin
15 1 月, 2026
0 Comments

DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For Sparse LLMs

Transformers use attention and…

admin
15 1 月, 2026
0 Comments

How to Build a Stateless, Secure, and Asynchronous MCP-Style Protocol for Scalable Agent Workflows

In this tutorial, we build a c…

admin
14 1 月, 2026
0 Comments

Introducing Community Benchmarks on Kaggle

Community Benchmarks on Kaggle…

admin
14 1 月, 2026
0 Comments

Announcing the winner of the Global AI Film Award

Over the past year, we’ve witn…

admin
14 1 月, 2026
0 Comments

Google AI Releases MedGemma-1.5: The Latest Update to their Open Medical AI Models for Developers

Google Research has expanded i…

admin
14 1 月, 2026
0 Comments

#489 – Paul Rosolie: Uncontacted Tribes in the Amazon Jungle

Paul Rosolie is a naturalist, …

admin
14 1 月, 2026
0 Comments

Veo 3.1 Ingredients to Video: More consistency, creativity and control

Today, we’re introducing an en…

admin
13 1 月, 2026
0 Comments

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce on Tuesday launched…

admin
13 1 月, 2026
0 Comments

Converge Bio raises $25M, backed by Bessemer and execs from Meta, OpenAI, Wiz

AI drug discovery startup Conv…

admin
13 1 月, 2026
0 Comments

Google AI Releases Universal Commerce Protocol (UCP): An Open-Source Standard Designed to Power the Next Generation of Agentic Commerce

Can AI shopping agents move be…

admin
13 1 月, 2026
0 Comments

How This Agentic Memory Research Unifies Long Term and Short Term Memory for LLM Agents

How do you design an LLM agent…

admin
12 1 月, 2026
0 Comments

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic released Cowork on M…

AI日报汇

A Coding Implementation on Qwen 3.6-35B-A3B Covering Multimodal Inference, Thinking Control, Tool Calling, MoE Routing, RAG, and Session Persistence

Moonshot AI Releases Kimi K2.6 with Long-Horizon Coding, Agent Swarm Scaling to 300 Sub-Agents and 4,000 Coordinated Steps

A Coding Implementation on Microsoft’s Phi-4-Mini for Quantized Inference Reasoning Tool Use RAG and LoRA Fine-Tuning

OpenAI Scales Trusted Access for Cyber Defense With GPT-5.4-Cyber: a Fine-Tuned Model Built for Verified Security Defenders

Moonshot AI and Tsinghua Researchers Propose PrfaaS: A Cross-Datacenter KVCache Architecture that Rethinks How LLMs are Served at Scale

Meet OpenMythos: An Open-Source PyTorch Reconstruction of Claude Mythos Where 770M Parameters Match a 1.3B Transformer

A Coding Implementation on Qwen 3.6-35B-A3B Covering Multimodal Inference, Thinking Control, Tool Calling, MoE Routing, RAG, and Session Persistence

Moonshot AI Releases Kimi K2.6 with Long-Horizon Coding, Agent Swarm Scaling to 300 Sub-Agents and 4,000 Coordinated Steps

A Coding Implementation on Microsoft’s Phi-4-Mini for Quantized Inference Reasoning Tool Use RAG and LoRA Fine-Tuning

A Coding Implementation on Qwen 3.6-35B-A3B Covering Multimodal Inference, Thinking Control, Tool Calling, MoE Routing, RAG, and Session Persistence

Moonshot AI Releases Kimi K2.6 with Long-Horizon Coding, Agent Swarm Scaling to 300 Sub-Agents and 4,000 Coordinated Steps

A Coding Implementation on Microsoft’s Phi-4-Mini for Quantized Inference Reasoning Tool Use RAG and LoRA Fine-Tuning

AI relative

NVIDIA AI Open-Sourced KVzap: A SOTA KV Cache Pruning Method that Delivers near-Lossless 2x-4x Compression

How Nano Banana got its name

Learners and educators are AI’s new “super users”

DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For Sparse LLMs

How to Build a Stateless, Secure, and Asynchronous MCP-Style Protocol for Scalable Agent Workflows

Introducing Community Benchmarks on Kaggle

Announcing the winner of the Global AI Film Award

Google AI Releases MedGemma-1.5: The Latest Update to their Open Medical AI Models for Developers

#489 – Paul Rosolie: Uncontacted Tribes in the Amazon Jungle

Veo 3.1 Ingredients to Video: More consistency, creativity and control

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Converge Bio raises $25M, backed by Bessemer and execs from Meta, OpenAI, Wiz

Google AI Releases Universal Commerce Protocol (UCP): An Open-Source Standard Designed to Power the Next Generation of Agentic Commerce

How This Agentic Memory Research Unifies Long Term and Short Term Memory for LLM Agents

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Other Story

A Coding Implementation on Qwen 3.6-35B-A3B Covering Multimodal Inference, Thinking Control, Tool Calling, MoE Routing, RAG, and Session Persistence

Moonshot AI Releases Kimi K2.6 with Long-Horizon Coding, Agent Swarm Scaling to 300 Sub-Agents and 4,000 Coordinated Steps

A Coding Implementation on Microsoft’s Phi-4-Mini for Quantized Inference Reasoning Tool Use RAG and LoRA Fine-Tuning

OpenAI Scales Trusted Access for Cyber Defense With GPT-5.4-Cyber: a Fine-Tuned Model Built for Verified Security Defenders

Moonshot AI and Tsinghua Researchers Propose PrfaaS: A Cross-Datacenter KVCache Architecture that Rethinks How LLMs are Served at Scale

Meet OpenMythos: An Open-Source PyTorch Reconstruction of Claude Mythos Where 770M Parameters Match a 1.3B Transformer