AI HN来自 Hacker News 的 AI 新闻
EN
今天
3天
7天
30天
全部
13 · "LlamaIndex"
每页
1
Production RAG: what I learned from processing 5M+ documents(blog.abdellatif.io)
551 ·tifa2up·2 个月前·114 评论
RAG & Retrieval
本文分享了为两家处理超500万文档的企业构建生产级RAG系统的经验,包括查询生成、重排序、分块、元数据使用和查询路由等关键策略。文章详细介绍了所用技术栈(向量数据库、分块工具、嵌入模型、重排序器),并提到基于这些经验的开源项目Agentset。
2
The Tiny Teams Playbook(latent.space)
137 ·tilt·3 个月前·46 评论
AI AgentInference Optimization
这篇来自Latent Space的文章包含关于AI案例研究(LLM系统设计、智能体、RAG)的评论,以及相关帖子如《2025 AI工程师阅读清单》(涵盖工具、模型、工作流程)和关于o1等模型的讨论。
3
How to Fix Your Context(dbreunig.com)
93 ·itzlambda·4 个月前·26 评论
RAG & Retrieval
这篇人工智能相关的新闻探讨了解决AI系统中上下文管理挑战的有效方法,尤其是提升上下文保留的准确性和相关性,以优化对话式AI等应用的性能。
4
AI assisted search-based research works now(simonwillison.net)
283 ·simonw·8 个月前·147 评论
RAG & Retrieval
💡 The story discusses AI-assisted search-based research, which likely involves tools for retrieval-augmented generation (RAG) or search frameworks (e.g., LlamaIndex-style systems), fitting the 'tools' category under Engineering.
5
That's a Lot of YAML(noyaml.com)
82 ·l0b0·9 个月前·55 评论
RAG & Retrieval
💡 The story likely addresses YAML usage in AI development workflows or tools (common in frameworks like LangChain/LlamaIndex), fitting the tools category for development tools & frameworks.
6
Ask HN: What are you using to parse PDFs for RAG?
163 ·carlbren·超过 1 年前·94 评论
RAG & Retrieval
💡 The story inquires about AI development tools (LangChain, LlamaIndex) for PDF parsing in the context of RAG, which aligns with the tools category under Engineering.
7
Solving the out-of-context chunk problem for RAG(d-star.ai)
260 ·zmccormick7·超过 1 年前·89 评论
RAG & Retrieval
💡 The story addresses solving the out-of-context chunk problem for Retrieval-Augmented Generation (RAG), a core technique used in AI development tools and frameworks like LangChain or LlamaIndex.
8
Making my local LLM voice assistant faster and more scalable with RAG(johnthenerd.com)
122 ·JohnTheNerd·超过 1 年前·16 评论
RAG & Retrieval
💡 The story focuses on optimizing a local LLM voice assistant using RAG, which relies on tools like vector databases and frameworks (e.g., LlamaIndex) that are classified under the tools category.
9
Ask HN: I have many PDFs – what is the best local way to leverage AI for search?
257 ·phodo·超过 1 年前·90 评论
RAG & Retrieval
💡 The story inquires about a local-first AI solution for PDF search, which typically relies on tools like LlamaIndex, LangChain (RAG frameworks) or vector databases—all explicitly listed under the 'tools' category.
10
Systematically Improving Your RAG(jxnl.co)
176 ·jxnlco·超过 1 年前·53 评论
RAG & Retrieval
💡 The article is about systematically improving Retrieval-Augmented Generation (RAG), a technique heavily reliant on AI development tools and frameworks like LangChain or LlamaIndex, which falls under the 'tools' category in Engineering.
11
Show HN: Cognita – open-source RAG framework for modular applications(github.com)
142 ·supreetgupta·超过 1 年前·34 评论
RAG & Retrieval
💡 The story introduces Cognita, an open-source RAG framework for modular applications, which aligns with the 'tools' category covering AI development frameworks like LangChain and LlamaIndex.
12
Understanding What Matters for LLM Ingestion and Preprocessing(unstructured.io)
60 ·mooreds·超过 1 年前·1 评论
RAG & Retrieval
💡 The story focuses on LLM ingestion and preprocessing, which are core functionalities supported by AI development tools (e.g., LangChain, LlamaIndex) for data preparation in LLM workflows.
13
LlamaCloud and LlamaParse(blog.llamaindex.ai)
195 ·eferreira_·将近 2 年前·82 评论
Inference OptimizationLLM Research
💡 The story introduces LlamaCloud and LlamaParse from LlamaIndex, which are AI development tools/frameworks for building LLM applications, fitting the 'tools' category under Engineering.
📅周报
Hacker News|Powered by Doubao