这个页面面向 “AI trends”“LLM trends”“AI model trends” 这类更宽泛的搜索意图,把新闻、发版速度、榜单变动与近期新模型发布整合到一个真实数据入口。
365
已追踪模型
53
45 天内发布
8
近期活跃服务商
290
新闻文章缓存
-3
周度模型变化
统计最近 45 天内有真实发布时间的模型,观察哪些服务商在加速发布。
最近发布:Gemini 3.1 Flash Lite · May 7, 2026
Alibaba
6最近发布:Qwen3.5 Plus 2026-04-20 · Apr 27, 2026
OpenAI
4最近发布:GPT Chat Latest · May 5, 2026
Anthropic
3最近发布:Claude Opus 4.7 (Fast) · May 12, 2026
inclusionai
3最近发布:Ring-2.6-1T (free) · May 8, 2026
xAI
3最近发布:Grok 4.3 · Apr 30, 2026
~anthropic
3最近发布:Anthropic Claude Haiku Latest · Apr 27, 2026
DeepSeek
3最近发布:DeepSeek V4 Pro · Apr 24, 2026
Anthropic launches Claude for Small Business to embed AI into the tools you forgot you pay for
The Decoder · May 13, 2026
Anthropic is launching "Claude for Small Business," a package of 15 agent-based workflows and integrations for tools like QuickBooks, PayPal, and HubSpot. The company is also rolling out free training courses and a work…
From Prompt to Pointer Engineering: Deepmind tries to reinvent the mouse cursor for the AI era
The Decoder · May 13, 2026
Pointer Engineering: Deepmind wants to turn the mouse cursor into the key variable in context engineering. The article From Prompt to Pointer Engineering: Deepmind tries to reinvent the mouse cursor for the AI era appea…
Rethinking Evaluation for LLM Hallucination Detection: A Desiderata, A New RAG-based Benchmark, New Insights
arXiv cs.AI · May 13, 2026
arXiv:2605.11330v1 Announce Type: new Abstract: Hallucination, broadly referring to unfaithful, fabricated, or inconsistent content generated by LLMs, has wide-ranging implications. Therefore, a large body of effort has…
Counterfactual Trace Auditing of LLM Agent Skills
arXiv cs.AI · May 13, 2026
arXiv:2605.11946v1 Announce Type: new Abstract: Large Language Model agents are increasingly augmented with agent skills. Current evaluation methods for skills remain limited. Most deployed benchmarks report only pass r…
IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection
arXiv cs.AI · May 13, 2026
arXiv:2605.11868v1 Announce Type: cross Abstract: Web-browsing AI agents are increasingly deployed in enterprise settings under strict whitelists of approved domains, yet adversaries can still influence them by embeddin…
Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving
arXiv cs.AI · May 13, 2026
arXiv:2605.11905v1 Announce Type: new Abstract: Automated theorem proving with large language models in Lean 4 is commonly approached through either step-level tactic prediction with tree search or whole-proof generatio…
Seir\^enes: Adversarial Self-Play with Evolving Distractions for LLM Reasoning
arXiv cs.AI · May 13, 2026
arXiv:2605.11636v1 Announce Type: new Abstract: We present Seir\^enes, a self-play RL framework that transforms contextual interference from a failure mode of LLM reasoning into an internal training signal for co-evolvi…
Hierarchical Multi-Scale Graph Neural Networks: Scalable Heterophilous Learning with Oversmoothing and Oversquashing Mitigation
arXiv cs.AI · May 13, 2026
arXiv:2605.10975v1 Announce Type: cross Abstract: Graphs with heterophily, where adjacent nodes carry different labels, are prevalent in real-world applications, from social networks to molecular interactions. However,…
OLIVIA: Online Learning via Inference-time Action Adaptation for Decision Making in LLM ReAct Agents
arXiv cs.AI · May 13, 2026
arXiv:2605.11169v1 Announce Type: new Abstract: Large language model agents interleave reasoning, action selection, and observation to solve sequential decision-making tasks. In deployed settings where agents repeatedly…
PIVOT: Bridging Planning and Execution in LLM Agents via Trajectory Refinement
arXiv cs.AI · May 13, 2026
arXiv:2605.11225v1 Announce Type: new Abstract: Large language model (LLM)-based agents frequently generate seemingly coherent plans that fail upon execution due to infeasible actions, constraint violations, and compoun…
Hugging Face
106最新文章:May 6, 2026
arXiv cs.AI
104最新文章:May 13, 2026
OpenAI
27最新文章:May 7, 2026
arXiv cs.CL
25最新文章:May 13, 2026
arXiv cs.LG
10最新文章:May 13, 2026
VentureBeat
4最新文章:Jan 19, 2026
最新周快照模型数
367
周度服务商变化
0
最新快照区间
May 4, 2026 - May 10, 2026
这个页面把近期模型发布、真实排名变动、服务商发版速度和新闻信号放在一起,形成一个更宽的 AI 趋势入口,而不只是单一的排行榜页面。
不是。排名趋势来自当前实时目录与最近一次归档周快照的对比;新闻来自缓存的来源文章;发布速度来自已追踪模型的真实发布时间。
AI 新闻页聚焦文章流和归档,而这个趋势页聚焦市场结构变化,例如谁在加速发版、哪些模型在上升、哪些服务商最近更活跃。