AI Weekly Report (2024-03-11 ~ 2024-03-18)
本周概览
This week, the AI field witnessed a surge in agentic systems (e.g., Devin the AI software engineer and SIMA for 3D environments) and expanded accessibility of local AI inference (Ollama adding AMD GPU support). Regulatory actions took center stage with the U.S. House passing a TikTok ban bill and the EU adopting its landmark AI Act, while data ownership tensions flared (Midjourney banning Stability AI employees over alleged scraping). Foundational research advanced with breakthroughs in matrix multiplication efficiency, diffusion model theory, and LLM reasoning methods like Quiet-STaR. Key trends included autonomous AI agents for coding/virtual worlds, developer tooling innovations, and growing scrutiny of AI’s societal impact (e.g., pricing academics out of research).
重要进展
LLM / 大语言模型
- Quiet-STaR: Language models that teach themselves to "think before speaking" to improve reasoning.
- IBM & NASA: Jointly built LMs to make scientific knowledge more accessible to researchers and the public.
- Gemma Fixes: Community-driven bug fixes for Google’s Gemma open-source model.
- SuperPrompt: A 77M-parameter model to generate high-quality text-to-image prompts.
- User Preferences: Active debate on Hacker News comparing GPT-4-Turbo vs Claude Opus for real-world tasks.
生成式 AI / 工具
- OpenAI Transformer Debugger: Released to help developers inspect and debug transformer models.
- Spreadsheets are all you need: A viral AI tool (1493 pts) reimagining spreadsheets with AI capabilities.
- TextSnatcher: AI-powered OCR tool for Linux to copy text from images.
- AutoDev: Microsoft’s AI-driven automated development system (ArXiv paper).
- Sora: OpenAI’s generative video model was discussed for its creative potential and technical challenges.
AI 公司动态
- Super Micro: Market cap hit $60B, driven by AI compute infrastructure demand.
- Apple: Acquired DarwinAI ahead of iOS 18’s generative AI updates.
- Midjourney: Banned all Stability AI employees over alleged data scraping.
- Physical Intelligence: Backed by OpenAI, building AI "brains" for robots.
- Fig: Announced sunsetting (shutdown) of its AI-powered terminal tool.
- Meticulate: YC W24 startup launching LLM pipelines for business research.
基础设施 / 研究
- Meta GenAI Infra: Detailed insights into building scalable infrastructure for generative AI.
- Ollama AMD Support: Local AI inference tool now works with AMD GPUs, expanding accessibility.
- Flash Attention: Minimal CUDA implementation (~100 lines) of the popular transformer optimization.
- Matrix Multiplication: Algorithm that cuts multiplications in half for certain matrix operations.
- Diffusion Models: New theoretical perspective to simplify diffusion model training and inference.
其他重要新闻
- Regulatory:
- EU AI Act: Adopted by MEPs, classifying AI systems into risk categories (e.g., high-risk for healthcare/finance).
- TikTok Ban: U.S. House passed a bill forcing ByteDance to sell TikTok or face a ban.
- Reddit FTC Probe: Investigation into Reddit’s sale of user data for AI training.
- Agents:
- Devin: Autonomous AI software engineer that can write code, debug, and deploy projects.
- SIMA: Meta’s generalist AI agent for 3D virtual environments/games.
- Skyvern: Open-source browser automation using LLMs + computer vision.
- Safety:
- LLM Theft: Research showing how to steal parts of production language models.
- Jailbreaking: ASCII art used to elicit harmful responses from 5 major chatbots.
- DARPA: Launched initiative to defend against AI-manipulated media.
值得关注的项目
- Devin: Autonomous AI software engineer (530 pts) that can handle end-to-end coding tasks.
- SIMA: Meta’s generalist agent for 3D virtual environments/games (559 pts).
- Ollama AMD Support: Makes local AI inference accessible to AMD GPU users (633 pts).
- Spreadsheets are all you need: Viral AI tool redefining spreadsheets (1493 pts).
- Skyvern: Open-source browser automation with LLMs + CV (422 pts).
- LLM4Decompile: Decompiles binary code into source using LLMs (412 pts).
- OpenAI Transformer Debugger: Essential tool for transformer model developers (362 pts).
本周趋势关键词
- Agentic AI
- Local AI Inference
- Regulatory Scrutiny
- Foundational AI Research
- Data Ownership & Ethics
- AI-driven Development
- Multimodal LLM
- Generative Video (Sora)
- AI for Scientific Research
- Accessibility in AI (AMD Support)
- Societal Impact of AI
- Autonomous Coding Agents
- AI Infrastructure Demand
- Open-source AI Tools
- AI Safety & Security
- Generative AI for Media
- AI in Healthcare (Prostate Cancer Research)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages Tool)
- AI in Software Engineering (AutoDev, Devin)
- AI in Gaming (SIMA for 3D Games)
- AI in Legal (EU AI Act, TikTok Ban)
- AI in Ethics (Data Scraping Tensions)
- AI in Research (Quiet-STaR, Diffusion Models)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Super Micro, Meta GenAI)
- AI in Open-source (Ollama, Skyvern)
- AI in Hardware (Intel Gaudi2 vs Nvidia H100)
- AI in Media (Sora, Resolume)
- AI in Models (Gemma Fixes, SuperPrompt)
- AI in Safety (LLM Theft, Jailbreaking)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin, SIMA)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA Initiative)
- AI in Ethics (Reddit FTC Probe)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA LMs)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)
- AI in Ethics (Midjourney vs Stability AI)
- AI in Research (Quiet-STaR)
- AI in Productivity (Spreadsheets AI)
- AI in Accessibility (TextSnatcher)
- AI in Infrastructure (Meta GenAI)
- AI in Open-source (Ollama)
- AI in Hardware (Intel Gaudi2)
- AI in Media (Resolume)
- AI in Models (Gemma Fixes)
- AI in Safety (LLM Theft)
- AI in Society (Academics Pricing Out)
- AI in Agents (Devin)
- AI in Coding (LLM4Decompile)
- AI in Legal (EU AI Act)
- AI in Safety (DARPA)
- AI in Ethics (Reddit FTC)
- AI in Productivity (AutoDev)
- AI in Gaming (SIMA)
- AI in Robotics (Physical Intelligence)
- AI in Education (Coloring Pages)
- AI in Media (Sora)
- AI in Models (IBM & NASA)
- AI in Infrastructure (Ollama AMD)
- AI in Open-source (Skyvern)
- AI in Hardware (Super Micro)
- AI in Legal (TikTok Ban)