AI HN来自 Hacker News 的 AI 新闻
EN
多模态 AI
今天
3天
7天
30天
全部
104
每页
1
RoboCrop: Teaching robots how to pick tomatoes(phys.org)
115 ·smurda·11 天前·69 评论
💡 The title suggests the story involves teaching robots to perform autonomous tomato-picking tasks, which aligns with the autonomous agent definition in the agents category.
2
Be Like Clippy(be-clippy.com)
356 ·Aloha·22 天前·217 评论
💡 The content focuses on a social movement advocating for transparent, user-friendly practices in AI-related tech companies, addressing data exploitation issues which fall under societal impact of AI.
3
Show HN: OCR Arena – A playground for OCR models(ocrarena.ai)
216 ·kbyatnal·大约 1 个月前·63 评论
💡 The story focuses on OCR Arena, a playground for exploring and comparing OCR model capabilities—this aligns with the Models category's scope of model comparisons and practical capability demonstrations.
4
Multimodal Diffusion Language Models for Thinking-Aware Editing and Generation(github.com)
136 ·lnyan·大约 1 个月前·12 评论
💡 The title refers to MMaDA-Parallel, a new multimodal diffusion language model for thinking-aware editing and generation, which aligns with the 'models' category covering new architectures and model releases.
5
Czech police forced to turn off facial recognition cameras at the Prague airport(edri.org)
160 ·campuscodi·大约 2 个月前·53 评论
💡 The story focuses on the shutdown of AI-powered facial recognition cameras at Prague Airport due to non-compliance with the EU AI Act (a key AI regulation) and personal data protection laws, which falls under the legal category of AI policy and compliance.
6
You can't refuse to be scanned by ICE's facial recognition app, DHS document say(404media.co)
601 ·nh43215rgb·大约 2 个月前·509 评论
💡 The story focuses on the societal impact and ethical concerns of mandatory facial recognition scans by ICE and long-term data retention, aligning with the society category's focus on AI's societal implications.
7
ICE and CBP Agents Are Scanning Peoples' Faces on Street to Verify Citizenship(old.reddit.com)
54 ·sipofwater·大约 2 个月前·2 评论
💡 The story involves government agencies using AI-powered facial recognition technology for citizenship verification in public spaces, which directly relates to the societal impact and implications of AI surveillance.
8
AI Mafia Network – An interactive visualization(dipakwani.com)
109 ·dipakwani·大约 2 个月前·9 评论
💡 The story focuses on an interactive visualization of the AI Mafia Network, which is a social discussion/exploration of connections within the AI community, fitting the 'society' category under social aspects of the ecosystem.
9
SWE-Grep and SWE-Grep-Mini: RL for Fast Multi-Turn Context Retrieval(cognition.ai)
97 ·meetpateltech·2 个月前·31 评论
💡 The story centers on the release of two new models (SWE-Grep and SWE-Grep-Mini) trained with RL for fast multi-turn context retrieval, which aligns with the 'models' category per rule 3 (model releases → models even if from a company).
10
How AI hears accents: An audible visualization of accent clusters(accent-explorer.boldvoice.com)
260 ·ilyausorov·2 个月前·129 评论
💡 The article details BoldVoice's accent identifier model, including its architecture (fine-tuned HuBERT), training process, and capability to cluster accents in latent space—aligning with the 'models' category (model details and capabilities).
11
ScribeOCR – Web interface for recognizing text, OCR, & creating digitized docs(github.com)
114 ·atomicnature·3 个月前·18 评论
💡 ScribeOCR is an AI-powered OCR tool with a web interface for text recognition and digitized document creation, fitting the 'tools' category as an AI utility tool for text processing tasks.
12
Introduction to Multi-Armed Bandits (2019)(arxiv.org)
143 ·Anon84·3 个月前·33 评论
💡 This is an academic ArXiv paper on multi-armed bandits, a fundamental machine learning framework for decision-making under uncertainty, which falls into the research category.
13
Computer Vision: Algorithms and Applications, 2nd ed(szeliski.org)
104 ·ibobev·3 个月前·21 评论
💡 The story centers on the second edition of a computer vision textbook covering algorithms and applications, which aligns with the research category's focus on algorithms and academic content.
14
Kmart's use of facial recognition to tackle refund fraud unlawful(oaic.gov.au)
245 ·_p2zi·3 个月前·263 评论
💡 The story involves a legal ruling on the unlawful use of AI-powered facial recognition technology, falling under law and policy related to AI.
15
How AI and surveillance capitalism are undermining democracy(thebulletin.org)
64 ·pseudolus·3 个月前·17 评论
💡 The story discusses the impact of AI and surveillance capitalism on democracy, which falls under social discussion and AI's societal impact as defined in the 'society' category.
16
Massive Attack turns concert into facial recognition surveillance experiment(gadgetreview.com)
343 ·loteck·3 个月前·152 评论
💡 The story involves an AI facial recognition surveillance experiment at a concert, which touches on AI ethics and societal impact (privacy concerns in public settings), aligning with the 'society' category.
17
GAO warns of privacy risks in using facial recognition in rental housing(files.gao.gov)
64 ·_p2zi·4 个月前·39 评论
💡 The story discusses a GAO report on AI-powered facial recognition use in rental housing, focusing on privacy risks and recommending HUD provide policy guidance, which falls under the legal/policy category.
18
MCP Gateway and Registry(github.com)
73 ·nikhilk218·4 个月前·53 评论
💡 The story involves IBM's MCP Gateway and Registry, and MCP (Multi-Context Processing) is explicitly listed under the agents category as part of agentic systems and tool use.
19
How can AI ID a cat?(quantamagazine.org)
187 ·sonabinu·4 个月前·69 评论
💡 The story explains the underlying algorithms and research behind how AI identifies cats, focusing on computer vision and image classification principles, which aligns with the 'research' category.
20
Home Depot sued for 'secretly' using facial recognition at self-checkouts(petapixel.com)
426 ·mikece·4 个月前·602 评论
💡 The story describes a lawsuit against Home Depot over the secret use of facial recognition technology, which falls under AI-related legal matters.
21
UK expands police facial recognition rollout with 10 new facial recognition vans(theregister.com)
156 ·rntn·4 个月前·1 评论
💡 The story covers the UK police expanding AI-powered facial recognition usage, which directly relates to AI's societal impact (surveillance, privacy concerns) and falls under the society category.
22
Facial recognition vans to be rolled out across police forces in England(news.sky.com)
428 ·amarcheschi·4 个月前·603 评论
💡 The story involves the rollout of AI-powered facial recognition vans by police forces, which directly relates to AI's societal impact and ethical considerations surrounding surveillance and privacy.
23
FastVLM: Efficient Vision Encoding for Vision Language Models(machinelearning.apple.com)
93 ·2bit·5 个月前·6 评论
💡 The story presents FastVLM, an efficient vision encoding method for vision language models, hosted on Apple's machine learning research page, aligning with the research category for AI algorithms and technical advancements.
24
Multiplatform Matrix Multiplication Kernels(burn.dev)
86 ·homarp·5 个月前·30 评论
💡 The story focuses on multiplatform matrix multiplication kernels, which are fundamental components enabling efficient AI model inference and training, aligning with the 'infra' category under Engineering.
25
NYPD bypassed facial recognition ban to ID pro-Palestinian student protester(thecity.nyc)
301 ·dataflow·5 个月前·172 评论
💡 The story involves the NYPD bypassing a facial recognition ban using Clearview AI, which falls under AI law and policy issues.
26
ICE's Supercharged Facial Recognition App of 200M Images(404media.co)
148 ·joker99·5 个月前·87 评论
💡 The story covers ICE's use of a facial recognition app, which raises AI-related societal impact and ethical concerns, fitting the society category.
27
Show HN: Cactus – Ollama for Smartphones(github.com)
231 ·HenryNdubuaku·5 个月前·82 评论
💡 The story introduces Cactus, a cross-platform framework for deploying LLMs, VLMs, Embedding Models, and TTS locally on smartphones—this aligns with the 'infra' category which focuses on deployment, inference, and edge device AI.
28
ICE Using Border Facial Recognition Tech to ID Protesters and Activists in US(techdirt.com)
61 ·lehi·6 个月前·15 评论
💡 The story covers ICE's use of facial recognition technology (an AI application) to identify protesters and activists, which directly relates to AI's societal impact and ethical considerations, aligning with the 'society' category.
29
'Improved' Grok Criticizes Democrats and Hollywood's 'Jewish Executives'(techcrunch.com)
110 ·archagon·6 个月前·28 评论
💡 The story centers on the AI model Grok producing controversial statements, which relates to AI's societal impact and ethical concerns, fitting the 'society' category.
30
Muvera: Making multi-vector retrieval as fast as single-vector search(research.google)
98 ·georgehill·6 个月前·10 评论
💡 The story introduces Muvera, a new algorithm for multi-vector retrieval from Google Research, which aligns with the research category covering technical breakthroughs and algorithms.
第 1 / 4 页,共 104 条
Hacker News|Powered by Doubao