AI HN来自 Hacker News 的 AI 新闻
EN
今天
3天
7天
30天
全部
37 · "o1"
每页
1
The quality of AI-assisted software depends on unit of work management(blog.nilenso.com)
170 ·mogambo1·3 个月前·123 评论
这篇文章探讨了在AI辅助软件开发中管理合适大小的工作单元的重要性,以优化上下文窗口的使用,从而提高生成代码的正确性和质量,同时减少错误传播。文章引用了Andrej Karpathy的见解、Anthropic的上下文窗口可视化以及Drew Breunig关于上下文问题的研究。
2
Being “Confidently Wrong” is holding AI back(promptql.io)
155 ·tango12·4 个月前·262 评论
AI Safety
人工智能系统存在的‘自信地犯错’问题——即以高度确定性提供错误信息——是阻碍其进一步发展和建立用户信任的关键障碍。
3
We may not like what we become if A.I. solves loneliness(newyorker.com)
506 ·defo10·5 个月前·1034 评论
AI Safety
这则新闻表达了担忧:若利用人工智能解决孤独问题,可能会导致人类发生我们不愿接受的不良转变。
4
6 weeks of Claude Code(blog.puzzmo.com)
581 ·mike1o1·5 个月前·590 评论
Anthropic & ClaudeCode & Development
这则新闻涉及为期六周的Claude Code相关内容,可能是Anthropic推出的AI编程工具或计划。
5
Fakespot shuts down today after 9 years of detecting fake product reviews(blog.truestar.pro)
416 ·doppio19·6 个月前·273 评论
已提供虚假产品评论检测服务9年的Fakespot于今日停止运营。
6
The Claude Bliss Attractor(astralcodexten.com)
58 ·Michelangelo11·7 个月前·1 评论
Anthropic & ClaudeCode & Development
这则新闻可能围绕Anthropic的Claude人工智能系统与Bliss Attractor(Bliss吸引子)概念展开,探讨该混沌理论原理如何为AI的稳定及优化性能或结果设计提供思路。
7
Local LLM inference – impressive but too hard to work with(medium.com)
84 ·aazo11·8 个月前·58 评论
Inference OptimizationLLM Research
💡 The story focuses on local LLM inference, which is explicitly listed under the infra category (deployment and inference of AI models)
8
OpenAI's o1-pro now available via API(platform.openai.com)
131 ·davidbarker·9 个月前·129 评论
OpenAI EcosystemAI Reasoning
💡 The story announces OpenAI's o1-pro model is now available via API, which falls under model releases/update category.
9
Intel appoints Lip-Bu Tan as its CEO(reuters.com)
286 ·yoyoyo1122·10 个月前·143 评论
💡 The story covers Intel appointing a new CEO, which is a leadership change falling under the 'company' category (business & corporate news). Intel is an AI-related company due to its AI chip offerings (e.g., Gaudi), making this story part of the AI ecosystem.
10
Using GRPO to Beat o1, o3-mini and R1 at “Temporal Clue”(openpipe.ai)
199 ·kcorbitt·10 个月前·55 评论
OpenAI EcosystemAI Reasoning
💡 The story focuses on using the GRPO method to outperform models like o1, o3-mini, and R1 on the 'Temporal Clue' task, which aligns with the research category (algorithms and task performance improvements).
11
Show HN: Firebender, a simple coding agent for Android Engineers(docs.firebender.com)
53 ·kevo1ution·10 个月前·18 评论
💡 The story presents Firebender, an autonomous coding agent for Android Engineers that performs tasks like writing tests autonomously, which falls under the agents category focusing on agentic workflows.
12
Making o1, o3, and Sonnet 3.7 hallucinate for everyone(bengarcia.dev)
267 ·hahahacorn·10 个月前·219 评论
AI SafetyAnthropic & Claude
💡 The story discusses inducing hallucinations in AI models (o1, o3, Sonnet 3.7), which falls under AI safety—specifically red teaming to test model vulnerabilities and robustness against hallucination issues
13
Train Your Own O1 Preview Model Within $450(sky.cs.berkeley.edu)
429 ·9woc·10 个月前·69 评论
OpenAI EcosystemAI Reasoning
💡 The story centers on training a version of the O1 Preview model at a low cost ($450), which aligns with the infra category's focus on AI model training and cost-efficient compute practices.
14
DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL(pretty-radio-b75.notion.site)
322 ·sijuntan·11 个月前·127 评论
OpenAI EcosystemAI Reasoning
💡 The story centers on the release of DeepScaleR, a 1.5B model that surpasses O1-Preview via scaled reinforcement learning, fitting the 'models' category which includes model releases and comparisons.
15
CAPTCHAs: 'a tracking cookie farm for profit masquerading as a security service'(pcgamer.com)
198 ·ghuroo1·11 个月前·133 评论
AI SafetyGoogle AI
💡 The story discusses a study's critical conclusion about CAPTCHAs (AI-related systems used for data collection and security) being a profit-driven tracking farm, focusing on their societal impact (billions of user hours spent) and ethical implications—core aspects of the 'society' category.
16
Show HN: NoSQL, but it's SQLite(gist.github.com)
98 ·vsroy·大约 1 年前·39 评论
OpenAI EcosystemAI Reasoning
💡 The story presents a development tool (NoSQL-like interface for SQLite) built using OpenAI's o1 model, which aligns with the 'tools' category under Engineering.
17
The GPT era is already ending(theatlantic.com)
55 ·bergie·大约 1 年前·28 评论
OpenAI EcosystemAI Reasoning
💡 The story focuses on OpenAI's o1 reasoning model, which is a model release/update falling under the Models category.
18
I spent 8 hours testing o1 Pro ($200) vs. Claude Sonnet 3.5 ($20)(old.reddit.com)
53 ·miles·大约 1 年前·0 评论
Anthropic & ClaudeCode & Development
💡 The story focuses on testing and comparing two AI models (o1 Pro and Claude Sonnet 3.5), which directly aligns with the 'models' category that includes model comparisons.
19
OpenAI o1 system card(openai.com)
417 ·meetpateltech·大约 1 年前·302 评论
OpenAI EcosystemAI Reasoning
💡 The story focuses on OpenAI's o1 system card, which involves model details and capabilities—core content of the models category.
20
Alibaba releases an 'open' challenger to OpenAI's O1 reasoning model(techcrunch.com)
121 ·bn-l·大约 1 年前·6 评论
OpenAI EcosystemAI Reasoning
💡 The story focuses on Alibaba releasing a new AI model that competes with OpenAI's O1 reasoning model, which falls under the 'models' category as per the rule that model releases are classified as 'models' regardless of the company origin.
21
QwQ: Alibaba's O1-like reasoning LLM(qwenlm.github.io)
438 ·amrrs·大约 1 年前·421 评论
OpenAI EcosystemAI Reasoning
💡 The story announces Alibaba's QwQ, an O1-like reasoning LLM, which falls under model releases as specified in the 'models' category.
22
LLaVA-O1: Let Vision Language Models Reason Step-by-Step(arxiv.org)
177 ·lnyan·大约 1 年前·32 评论
💡 The story is an ArXiv paper about LLaVA-O1, a vision language model focused on step-by-step reasoning, which aligns with the research category (academic papers and reasoning research).
23
Support for Claude Sonnet 3.5, OpenAI O1 and Gemini 1.5 Pro(qodo.ai)
69 ·benocodes·大约 1 年前·35 评论
Anthropic & ClaudeCode & Development
💡 The story announces Qodo adding support for Claude Sonnet3.5, OpenAI O1, and Gemini1.5 Pro—this is an update to a development tool integrating new AI models, fitting the 'tools' category.
24
Show HN: Steiner – An open-source reasoning model inspired by OpenAI o1(medium.com)
83 ·peakji·大约 1 年前·19 评论
OpenAI EcosystemAI Reasoning
💡 The story is a Show HN about the release of Steiner, an open-source reasoning model, which falls under the Models category as it involves a new model release.
25
Qualcomm Wants to Buy Intel(theverge.com)
189 ·oco101·超过 1 年前·100 评论
💡 The story covers Qualcomm's potential acquisition of Intel, a business and corporate news event involving key players in the AI hardware ecosystem.
26
g1: Using Llama-3.1 70B on Groq to create o1-like reasoning chains(github.com)
334 ·gfortaine·超过 1 年前·148 评论
AI ChipsOpenAI Ecosystem
💡 The story centers on using Llama-3.1 (a model) to create o1-like reasoning chains, which aligns with the 'model capabilities' subcategory of the Models category.
27
Terence Tao on O1(mathstodon.xyz)
664 ·dselsam·超过 1 年前·482 评论
OpenAI EcosystemAI Reasoning
💡 The story involves mathematician Terence Tao sharing his perspectives on OpenAI's O1 model, which falls under social discussion and expert opinions on AI, fitting the 'society' category.
28
OpenAI o1 Results on ARC-AGI-Pub(arcprize.org)
187 ·z7·超过 1 年前·118 评论
OpenAI EcosystemAI Reasoning
💡 The story focuses on OpenAI o1 model's performance results on the ARC-AGI-Pub benchmark, which falls under the 'models' category (includes benchmark results and model capabilities)
29
OpenAI threatens to revoke o1 access for asking it about its chain of thought(twitter.com)
538 ·jsheard·超过 1 年前·305 评论
AI SafetyOpenAI Ecosystem
💡 The story involves OpenAI threatening to revoke access to its o1 model for users asking about its chain of thought, which relates to mechanistic interpretability—a subtopic of AI safety.
30
Notes on OpenAI's new o1 chain-of-thought models(simonwillison.net)
699 ·loganfrederick·超过 1 年前·629 评论
OpenAI EcosystemAI Reasoning
💡 The story discusses OpenAI's new o1 chain-of-thought models, which falls under model releases—one of the key subcategories of the 'models' category.
第 1 / 2 页,共 37 条
📅周报
Hacker News|Powered by Doubao