AI日报汇 – 第 2 页

AI relative

Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool Invocation, and Autonomous Iteration on the Bailian Platform

By admin
2 6 月, 2026
0 Comments
1 views

AI relative

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

By admin
2 6 月, 2026
0 Comments
1 views

AI relative

How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Native torch.amp

By admin
2 6 月, 2026
0 Comments
1 views

AI relative

MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multimodality, and Agentic Coding

By admin
2 6 月, 2026
0 Comments
1 views

AI relative

Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Top of Hermes Agent

By admin
2 6 月, 2026
0 Comments
1 views

AI relative

How we used Gemini to build Google I/O 2026

By admin
2 6 月, 2026
0 Comments
1 views

AI relative

Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool Invocation, and Autonomous Iteration on the Bailian Platform

2 6 月, 2026

AI relative

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

2 6 月, 2026

AI relative

How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Native torch.amp

2 6 月, 2026

AI relative

Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool Invocation, and Autonomous Iteration on the Bailian Platform

2 6 月, 2026

AI relative

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

2 6 月, 2026

AI relative

How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Native torch.amp

2 6 月, 2026

AI relative

658

admin
30 5 月, 2026
0 Comments

NVIDIA Introduces X-Token: Projection-Guided Cross-Tokenizer KD That Outperforms GOLD by +3.82 Average Points on Llama-3.2-1B

Knowledge distillation (KD) tr…

admin
30 5 月, 2026
0 Comments

StepFun Releases Step 3.7 Flash: A 198B MoE Vision-Language Model for Coding Agents and Search Workflows

StepFun today released Step 3.…

admin
29 5 月, 2026
0 Comments

Check out real-life AI prototypes from the Futures Lab.

University of Waterloo student…

admin
29 5 月, 2026
0 Comments

Meet mKernel: A Multi-GPU, Multi-Node Fused Kernel Library for GPU-Driven Communication

GPU communication overhead is …

admin
29 5 月, 2026
0 Comments

Hexo Labs Open-Sources SIA: A Self-Improving Agent That Updates Both the Harness and the Model Weights

Most AI agents stop improving …

admin
29 5 月, 2026
0 Comments

How to Design an End-to-End Ansible Automation Lab with Playbooks, Inventories, Roles, Vault, Dynamic Inventory, and Custom Modules

In this tutorial, we build a c…

admin
29 5 月, 2026
0 Comments

Liquid AI Releases LFM2.5-8B-A1B: An On-Device MoE Model With 8.3B Total and 1.5B Active Parameters

Liquid AI just shipped LFM2.5-…

admin
28 5 月, 2026
0 Comments

Perplexity AI Open-Sources Unigram Tokenizer That Achieves 5x Lower p50 Latency Than Hugging Face tokenizers Crate

Perplexity AI’s research team …

admin
28 5 月, 2026
0 Comments

A Coding Guide to Implement a pgvector-Powered Semantic, Hybrid, Sparse, and Quantized Vector Search System

In this tutorial, we build a c…

admin
28 5 月, 2026
0 Comments

Sakana AI Proposes DiffusionBlocks: a Block-wise Training Framework That Converts Residual Networks into Independently Trainable Denoising Modules

Researchers from Sakana AI and…

admin
28 5 月, 2026
0 Comments

NVIDIA Releases Polar, a Token-Faithful Rollout Framework for GRPO Training Across Codex, Claude Code, and Qwen Code

Reinforcement learning for lan…

admin
27 5 月, 2026
0 Comments

Meet EAGLE 3.1: The Speculative Decoding Algorithm That Fixes Attention Drift in LLM Inference

Speculative decoding is a tech…

admin
27 5 月, 2026
0 Comments

MEMO: A Modular Framework for Training a Dedicated Memory Model on New Knowledge Without Modifying LLM Parameters

Large language models become s…

admin
27 5 月, 2026
0 Comments

Design a High-Precision Retrieve-and-Rerank Pipeline with ZeroEntropy Zerank-2 Reranker

In this tutorial, we use zeroe…

admin
27 5 月, 2026
0 Comments

Stability AI Releases Stable Audio 3: A Family of Fast Latent Diffusion Models for Audio Generation and Editing

Stability AI has released open…

Other Story

AI relative

Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool Invocation, and Autonomous Iteration on the Bailian Platform

admin
2 6 月, 2026

AI relative

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

admin
2 6 月, 2026

AI relative

How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Native torch.amp

admin
2 6 月, 2026

AI relative

MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multimodality, and Agentic Coding

admin
2 6 月, 2026

AI relative

Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Top of Hermes Agent

admin
2 6 月, 2026

AI relative

How we used Gemini to build Google I/O 2026

admin
2 6 月, 2026