AI HN来自 Hacker News 的 AI 新闻
EN
今天
3天
7天
30天
全部
14 · "video generation"
每页
1
Ovi: Twin backbone cross-modal fusion for audio-video generation(github.com)
314 ·montyanderson·2 个月前·114 评论
该内容介绍了Ovi,这是一个由character-ai在GitHub上发布的、采用双骨干跨模态融合技术的音视频生成模型。
2
Paper2video: Automatic video generation from scientific papers(arxiv.org)
91 ·jinqueeny·3 个月前·24 评论
ArXiv论文《Paper2Video》介绍了首个包含101篇研究论文及其作者创建的演示视频、幻灯片和元数据的基准数据集,并提出四个定制化评估指标。论文还提出多智能体框架PaperTalker,整合幻灯片生成、光标定位、字幕合成、语音合成和虚拟发言人渲染等功能,生成的学术演示视频在信息忠实度和丰富度上优于现有基线模型。
3
Voyager – An interactive video generation model with realtime 3D reconstruction(github.com)
322 ·mingtianzhang·4 个月前·225 评论
该内容介绍了腾讯混元开发的交互式视频生成模型Voyager,该模型支持实时3D重建。其GitHub仓库可能包含该模型的技术细节和使用指南。
4
Gemini Diffusion(simonwillison.net)
890 ·mdp2021·7 个月前·244 评论
Google AI
💡 The title 'Gemini Diffusion' combines Google's Gemini model with diffusion techniques, which are core to generative media (e.g., image/video generation), aligning with the media category.
5
AniSora: Open-source anime video generation model(komiko.app)
356 ·PaulineGar·7 个月前·218 评论
Image Generation
💡 The story focuses on AniSora, an open-source anime video generation model, which falls under generative media (video) as defined in the Applications category.
6
LTXVideo 13B AI video generation(ltxv.video)
216 ·zoudong376·8 个月前·64 评论
Code & Development
💡 The story is about LTXVideo 13B, an AI video generation model, which falls under the generative media category focusing on video content.
7
Packing Input Frame Context in Next-Frame Prediction Models for Video Generation(lllyasviel.github.io)
270 ·GaggiX·8 个月前·27 评论
AI ChipsCode & Development
💡 The story discusses a technique for next-frame prediction models used in video generation, which falls under the generative media category.
8
Tom and Jerry One-Minute Video Generation with Test-Time Training(test-time-training.github.io)
80 ·walterbell·9 个月前·18 评论
💡 The story is about generating a Tom and Jerry one-minute video using test-time training, which falls under generative video, a subcategory of the media category.
9
Veo 2: Our video generation model(deepmind.google)
587 ·mvoodarla·大约 1 年前·327 评论
Google AIVideo Generation
💡 The story focuses on Veo 2, a video generation model, which falls under the generative media (video) subcategory of Applications.
10
Apple Smells Blood in the Water(petapixel.com)
89 ·atombender·大约 1 年前·92 评论
Apple AI
💡 The story originates from Petapixel, a media tech outlet focused on visual content, implying it likely covers Apple's AI-powered generative media feature (e.g., image/video generation/editing), which aligns with the media category.
11
Open-Sora does pretty good video generation on consumer GPUs(backprop.co)
109 ·kristoo·超过 1 年前·129 评论
OpenAI EcosystemVideo Generation
💡 The story focuses on Open-Sora, a generative video model, which aligns with the media category covering generative media (video generation).
12
Highly realistic talking head video generation(github.com)
136 ·HuiLi1998·超过 1 年前·60 评论
💡 The story focuses on highly realistic talking head video generation, which falls under the generative media category covering video content creation.
13
StoryDiffusion: Long-range image and video generation(storydiffusion.github.io)
233 ·doodlesdev·超过 1 年前·65 评论
Image Generation
💡 StoryDiffusion is a project for long-range image and video generation, which falls under the generative media category.
14
VideoGigaGAN: Towards Detail-Rich Video Super-Resolution(videogigagan.github.io)
188 ·bookofjoe·超过 1 年前·3 评论
Code & Development
💡 The story focuses on VideoGigaGAN, a generative AI model designed for detail-rich video super-resolution, which falls under the generative media category (specifically video generation/enhancement).
📅周报
Hacker News|Powered by Doubao