开源 AI - AI HN

AI HN来自 Hacker News 的 AI 新闻

中

开源 AI

今天

3天

7天

30天

全部

317条

每页

Cursor Acquires Graphite(graphite.com)

11 分·timvdalen·2 天前·3 评论

💡 The story announces the acquisition of Graphite (a code review platform) by Cursor (an AI IDE), which falls under business and corporate news.

Graphite Is Joining Cursor(cursor.com)

66 分·fosterfriends·2 天前·23 评论

💡 The story announces Cursor's acquisition of Graphite, a code review platform, which falls under business and corporate news in the 'company' category.

Qwen-Image-Layered: transparency and layer aware open diffusion model(huggingface.co)

55 分·dvrp·3 天前·7 评论

💡 This is an academic paper presenting Qwen-Image-Layered, a new diffusion model for image layer decomposition, which aligns with the Research category's focus on academic papers and algorithms.

T5Gemma 2: The next generation of encoder-decoder models(blog.google)

141 分·milomg·3 天前·26 评论

💡 The story centers on the release of T5Gemma 2, a new encoder-decoder model from Google, which aligns with the 'models' category as model releases are explicitly classified here per the rules.

FunctionGemma 270M Model(blog.google)

211 分·mariobm·3 天前·54 评论

💡 The story is about the release of the FunctionGemma 270M model via Google's technology developers blog, which falls under the Models category (model releases).

Mistral OCR 3(mistral.ai)

53 分·pember·4 天前·0 评论

💡 The story announces the release of Mistral OCR 3, an AI model for document processing, which aligns with the 'models' category covering model releases and updates.

Prompt caching for cheaper LLM tokens(ngrok.com)

207 分·samwho·5 天前·47 评论

💡 Prompt caching is an optimization technique for LLM inference to reduce token costs, which falls under the 'infra' category covering deployment and inference-related topics.

DeepSeek uses banned Nvidia chips for AI model, report says(finance.yahoo.com)

329 分·goodway·11 天前·316 评论

💡 The story centers on Chinese AI startup DeepSeek using smuggled Nvidia Blackwell chips (a key AI hardware component banned in China by U.S. regulations) to develop an upcoming AI model, aligning with the Hardware category's focus on AI chips and compute.

Qwen3-Omni-Flash-2025-12-01：a next-generation native multimodal large model(qwen.ai)

314 分·pretext·11 天前·106 评论

💡 The story focuses on the release of Qwen3-Omni-Flash-2025-12-01, a next-generation native multimodal large model, which directly falls into the 'models' category as per the rule that model releases belong to 'models' regardless of company affiliation.

Mistral releases Devstral2 and Mistral Vibe CLI(mistral.ai)

745 分·pember·13 天前·349 评论

💡 The story centers on Mistral AI's release of Devstral 2, a coding model family with benchmark results (SWE-bench Verified) and competitor comparisons, which aligns with the Models category per Rule 3 (model releases → models regardless of company origin). The Mistral Vibe CLI is an accompanying tool, but the core announcement focuses on the model release.

Zebra-Llama – Towards efficient hybrid models(arxiv.org)

113 分·mirrir·15 天前·61 评论

💡 The story focuses on the release of Zebra-Llama, a new hybrid language model family with 1B/3B/8B variants, including new architecture elements (SSM and MLA combination) and performance comparisons—all fitting the Models category criteria.

Prompt Injection via Poetry(wired.com)

90 分·bumbailiff·18 天前·35 评论

💡 The article focuses on prompt injection via poetry to bypass AI safeguards for nuclear weapon-related assistance, which directly falls under AI safety (security and red teaming).

Launch HN: Phind 3 (YC S22) – Every answer is a mini-app

138 分·rushingcreek·18 天前·94 评论

💡 Phind 3 is a versioned launch of an AI answer engine, which qualifies as a model release under category rules (model releases → models even if from a company)

Mistral 3 family of models released(mistral.ai)

826 分·pember·20 天前·236 评论

💡 The story is an announcement of the Mistral 3 family of AI models, including small dense models and Mistral Large 3 (a mixture-of-experts model), which directly falls under the models category for model releases.

DeepSeek-v3.2: Pushing the frontier of open large language models [pdf](huggingface.co)

982 分·pretext·20 天前·465 评论

💡 The story focuses on the release of DeepSeek-v3.2, an open large language model, which directly falls under the models category (model releases are explicitly listed in this category).

DeepSeek-v3.2(huggingface.co)

63 分·meetpateltech·21 天前·1 评论

💡 The story centers on the release of DeepSeek-V3.2, an AI model with details on its capabilities (efficient reasoning, agent performance), technical innovations (DSA attention mechanism, scalable RL framework), and benchmark achievements (surpassing GPT-5 variant, IMO/IOI gold medals), fitting the 'models' category for releases and capabilities.

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning(huggingface.co)

264 分·victorbuilds·21 天前·88 评论

💡 The story focuses on the release of DeepSeekMath-V2, a model for self-verifiable mathematical reasoning, which aligns with the 'models' category (model releases).

Program-of-Thought Prompting Outperforms Chain-of-Thought by 15% (2022)(arxiv.org)

136 分·mkagenius·21 天前·36 评论

💡 The story is about an arXiv paper proposing Program of Thoughts (PoT) prompting, an academic research on numerical reasoning methods for language models, which falls under the 'research' category as per classification rules.

Qwen3-VL can scan two-hour videos and pinpoint nearly every detail(the-decoder.com)

265 分·thm·22 天前·82 评论

💡 The story focuses on Qwen3-VL, an AI model's capability to scan two-hour videos and pinpoint details, which falls under model releases/updates.

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf](github.com)

231 分·fspeech·24 天前·50 评论

💡 The story links to a PDF paper about DeepSeekMath-V2, focusing on self-verifiable mathematical reasoning—this falls under the research category as it involves reasoning research (a key subpoint of the research category).

Show HN: Runprompt – run .prompt files from the command line(github.com)

134 分·chr15m·25 天前·49 评论

💡 The story introduces Runprompt, a command-line tool for executing .prompt files, which falls under AI development tools (prompt engineering tools).

Google Antigravity exfiltrates data via indirect prompt injection attack(promptarmor.com)

768 分·jjmaxwell4·26 天前·215 评论

💡 The story details an indirect prompt injection attack on Google Antigravity (an AI agentic code editor) leading to data exfiltration, which falls under the 'safety' category covering AI security issues like prompt injection and red teaming.

73% of AI startups are just prompt engineering(pub.towardsai.net)

246 分·kllrnohj·28 天前·205 评论

💡 The article focuses on the business practices of AI startups, revealing that most rely on third-party APIs instead of proprietary tech, which falls under business & corporate news in the ecosystem category.

Measuring the impact of AI scams on the elderly(simonlermen.substack.com)

101 分·DalasNoin·大约 1 个月前·42 评论

💡 The story focuses on AI model jailbreaking for phishing scams and measuring their impact on the elderly, which aligns with AI security and safety issues including jailbreaks.

Continuous Autoregressive Language Models(arxiv.org)

115 分·Anon84·大约 2 个月前·10 评论

💡 The story focuses on an ArXiv paper introducing CALM, a new language model architecture shifting from discrete token prediction to continuous vector prediction, which falls under the Models category as it includes new architecture releases like Mamba or SSM.

Google pulls AI model after senator says it fabricated assault allegation(theverge.com)

84 分·croemer·大约 2 个月前·93 评论

💡 The story involves Google (a company) removing its AI model Gemma after it fabricated an assault allegation, which falls under corporate news related to the company's AI product management.

New prompt injection papers: Agents rule of two and the attacker moves second(simonwillison.net)

114 分·simonw·大约 2 个月前·44 评论

💡 The article focuses on research about prompt injection and AI agent security, which directly falls under the safety category (includes prompt injection and AI security topics).

Tongyi DeepResearch – open-source 30B MoE Model that rivals OpenAI DeepResearch(tongyi-agent.github.io)

365 分·meander_water·大约 2 个月前·153 评论

💡 The story focuses on Tongyi DeepResearch, an open-source autonomous Web Agent with agentic features (ReAct mode, RL training pipeline) and benchmark performance, fitting the 'agents' category.

Karpathy on DeepSeek-OCR paper: Are pixels better inputs to LLMs than text?(twitter.com)

410 分·JnBrymn·2 个月前·173 评论

💡 The story involves Karpathy's discussion of the DeepSeek-OCR academic paper, which explores whether pixels are better inputs for LLMs than text—this aligns with the research category (academic papers and algorithmic explorations).

Getting DeepSeek-OCR working on an Nvidia Spark via brute force with Claude Code(simonwillison.net)

201 分·simonw·2 个月前·45 评论

💡 The article focuses on using Claude Code (an AI coding assistant) to perform AI-assisted programming tasks, including setting up the DeepSeek-OCR model environment, generating scripts, and automating dependencies installation, which aligns with the 'coding' category of AI-assisted programming.

...

第 1 / 11 页，共 317 条