Github Ranking /
2026-05-06
Back to Rankings返回排行榜

🎬 Top 100 · Video Generation前 100 · 视频生成

100 repositories sorted by video generation 按 视频生成 排序,共 100 个仓库

📦 100 repos个仓库 🕐 2026-05-06
# Repository仓库 Stars Forks Language语言 Issues Description描述 Last Commit最后提交
1 diffusers huggingface 33.6k 7.0k Python 716 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.🤗 扩散器:PyTorch 中用于生成图像、视频和音频的最先进的扩散模型。 2026-05-06
2 LivePortrait KlingAIResearch 18.3k 1.9k Python 275 Bring portraits to life!让肖像栩栩如生! 2026-03-02
3 Wan2.2 Wan-Video 15.6k 1.9k Python 238 Wan: Open and Advanced Large-Scale Video Generative ModelsWan:开放且先进的大规模视频生成模型 2026-03-17
4 Duix-Avatar duixcom 12.9k 2.1k C 397 🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.🚀 真正开源的人工智能头像(数字人)工具包,用于离线视频生成和数字人克隆。 2026-04-21
5 CogVideo zai-org 12.7k 1.3k Python 104 text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)文本和图像到视频生成:CogVideoX (2024) 和 CogVideo (ICLR 2023) 2025-11-04
6 HunyuanVideo Tencent-Hunyuan 12.1k 1.2k Python 158 HunyuanVideo: A Systematic Framework For Large Video Generation Model混源视频:大视频生成模型的系统框架 2025-11-21
7 waoowaoo saturndec 12.0k 2.7k TypeScript 118 首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.首家工业级全流程 AI 影视生产平台。 Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows. 2026-05-04
8 Pixelle-Video AIDC-AI 11.7k 1.8k Python 67 🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine 2026-04-13
9 Open-Generative-AI Anil-matcha 11.5k 2.0k JavaScript 5 Uncensored, open-source alternative to Higgsfield AI, Freepik AI, Krea AI, Openart AI — Free, unrestricted AI image & video generation studio with 200+ models (Flux, Midjourney, Kling, Sora, Veo). No content filters. Self-hosted, MIT licensed.Higgsfield AI、Freepik AI、Krea AI、Openart AI 的未经审查、开源替代方案 — 免费、不受限制的 AI 图像和视频生成工作室,拥有 200 多个模型(Flux、Midjourney、Kling、Sora、Veo)。没有内容过滤器。自托管,麻省理工学院许可。 2026-05-05
10 imaginAIry brycedrennan 8.1k 474 Python 22 Pythonic AI generation of images and videosPythonic AI 生成图像和视频 2026-02-24
11 Toonflow-app HBAI-Ltd 7.6k 1.3k HTML 1 Toonflow 是开源一站式 AI 短剧创作工具,将小说、剧本快速转化为动画短剧。集成 AI 编剧、智能分镜、角色与视频生成,跨平台桌面端轻量部署,助力创作者低成本批量产出视觉内容。Toonflow is an open-source AI tool that turns stories and scripts into animated short dramas. Features AI scriptwriting, storyboarding, character and video generation. A cross-platform desktop app for efficient content creation.Toonflow 是开源一站式 AI 短剧创作工具,将小说、剧本快速转化为动画短剧。集成 AI 编剧、智能分镜、角色与视频生成,跨平台桌面端轻量部署,助力创作者低成本批量产出视觉内容。 Toonflow is an open-source AI tool that turns stories and scripts into animated short dramas. Features AI scriptwriting, storyboarding, character and video generation. A cross-platform desktop app for efficient content creation. 2026-05-01
12 mmagic open-mmlab 7.4k 1.1k Jupyter Notebook 61 OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.OpenMMLab 多模式高级、生成和智能创建工具箱。解锁魔法🪄:生成式人工智能 (AIGC)、易于使用的 API、出色的模型动物园、扩散模型,用于文本到图像生成、图像/视频恢复/增强等。 2024-08-06
13 InfiniteTalk MeiGen-AI 6.5k 1.1k Python 156 ​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation​​无限长度的对话视频生成​​,支持图像到视频和视频到视频的生成 2025-12-18
14 Awesome-Video-Diffusion showlab 5.6k 357 N/A 0 A curated list of recent diffusion models for video generation, editing, and various other applications.用于视频生成、编辑和各种其他应用的最新扩散模型的精选列表。 2026-04-03
15 Sana NVlabs 5.1k 346 Python 99 SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion TransformerSANA:使用线性扩散变压器进行高效高分辨率图像合成 2026-04-14
16 autoclip zhouxiaoka 5.1k 1.1k Python 48 AutoClip : AI-powered video clipping and highlight generation · 一款智能高光提取与剪辑的二创工具 2025-09-24
17 VideoCrafter AILab-CVC 5.1k 410 Python 71 VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion ModelsVideoCrafter2:克服高质量视频扩散模型的数据限制 2026-01-09
18 mmaction2 open-mmlab 5.0k 1.4k Python 284 OpenMMLab's Next Generation Video Understanding Toolbox and BenchmarkOpenMMLab 的下一代视频理解工具箱和基准 2026-03-18
19 vllm-omni vllm-project 4.6k 876 Python 385 A framework for efficient model inference with omni-modality models全模态模型的高效模型推理框架 2026-05-06
20 echomimic_v2 antgroup 4.6k 535 Python 74 [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation[CVPR 2025] EchoMimicV2:迈向引人注目、简化和半身人体动画 2026-02-23
21 HunyuanVideo-1.5 Tencent-Hunyuan 4.4k 225 Python 29 HunyuanVideo-1.5: A leading lightweight video generation modelHunyuanVideo-1.5:领先的轻量级视频生成模型 2026-04-10
22 Tune-A-Video showlab 4.4k 391 Python 36 [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation[ICCV 2023] Tune-A-Video:用于文本到视频生成的图像扩散模型的一次性调整 2023-10-25
23 champ fudan-generative-vision 4.3k 483 Python 49 [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance[ECCV 2024] Champ:具有 3D 参数化指导的可控且一致的人体图像动画 2024-07-10
24 Text2Video-Zero Picsart-AI-Research 4.2k 386 Python 47 [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators[ICCV 2023 Oral] 文本到图像扩散模型是零样本视频生成器 2023-05-06
25 Qwen2.5-Omni QwenLM 4.0k 323 Jupyter Notebook 213 Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.Error 500 (Server Error)!!1500.That’s an error.There was an error. Please try again later.That’s all we know. 2025-06-12
26 short-video-factory YILS-LIN 3.9k 577 TypeScript 22 一键生成产品营销与泛内容短视频,AI批量自动剪辑,高颜值跨平台桌面端工具 One click generation of product marketing and general content short videos, AI batch automatic cliping, beautiful cross platform desktop tool 2026-04-07
27 VACE ali-vilab 3.8k 261 Python 54 [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing[ICCV 2025] 论文的官方实现:VACE: All-in-One Video Creation and Editing 2025-10-17
28 MAGI-1 SandAI-org 3.7k 237 Python 38 MAGI-1: Autoregressive Video Generation at ScaleMAGI-1:大规模自回归视频生成 2025-06-17
29 lingbot-world Robbyant 3.7k 316 Python 25 Advancing Open-source World Models推进开源世界模式 2026-04-10
30 mochi genmoai 3.6k 476 Python 50 The best OSS video generation models, created by Genmo最好的 OSS 视频生成模型,由 Genmo 创建 2025-11-14
31 TurboDiffusion thu-ml 3.5k 253 Python 68 TurboDiffusion: 100–200× Acceleration for Video Diffusion ModelsTurboDiffusion:视频扩散模型的 100–200× 加速 2026-04-15
32 OpenMontage calesthio 3.5k 689 Python 16 World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.世界上第一个开源的代理视频制作系统。 12条管道、52种工具、500+座席技能。将您的人工智能编码助手变成一个完整的视频制作工作室。 2026-04-28
33 FastVideo hao-ai-lab 3.4k 328 Python 63 A unified inference and post-training framework for accelerated video generation.用于加速视频生成的统一推理和后训练框架。 2026-05-05
34 SageAttention thu-ml 3.3k 404 Cuda 162 [ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.[ICLR2025、ICML2025、NeurIPS2025 Spotlight] 与 FlashAttention 相比,量化注意力实现了 2-5 倍的加速,并且不会丢失跨语言、图像和视频模型的端到端指标。 2026-01-17
35 Jellyfish Forget-C 3.2k 581 Python 5 An end-to-end production workspace for AI-generated short dramas. From script input to structured storyboarding, consistency management, shot preparation, video generation, and export.用于人工智能生成的短剧的端到端制作工作区。从脚本输入到结构化故事板、一致性管理、镜头准备、视频生成和导出。 2026-04-20
36 InternGPT OpenGVLab 3.2k 235 Python 19 InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models.现在它支持DragGAN、ChatGPT、ImageBind、多模式聊天(如GPT-4、SAM)、交互式图像编辑等。请在igpt.opengvlab.com上尝试(支持DragGAN、ChatGPT、ImageBind、SAM的在线演示系统) 2024-08-20
37 Pyramid-Flow jy0205 3.2k 299 Python 68 [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling[ICLR 2025] 用于高效视频生成建模的金字塔流匹配 2024-12-21
38 Generative-Media-Skills SamurAIGPT 3.2k 349 Shell 0 Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.AI 代理的多模式生成媒体技能(Claude Code、Cursor、Gemini CLI)。由 muapi.ai 提供支持的高质量图像、视频和音频生成。 2026-05-02
39 VGen ali-vilab 3.2k 274 Python 112 Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion modelsVGen 的官方存储库:基于扩散模型的视频生成的整体视频生成生态系统 2025-01-10
40 DynamiCrafter Doubiiu 3.0k 246 Python 86 [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors[ECCV 2024,口头] DynamiCrafter:使用视频扩散先验对开放域图像进行动画处理 2024-09-08
41 PersonaLive GVCLab 2.9k 405 Python 30 [CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming[CVPR 2026] PersonaLive! :用于直播的富有表现力的肖像图像动画 2026-03-05
42 MultiTalk MeiGen-AI 2.9k 485 Python 150 [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation[NeurIPS 2025] 让他们说话:音频驱动的多人对话视频生成 2025-12-18
43 goku Saiyan-World 2.9k 311 Python 0 [CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/【CVPR2025亮点】视频生成基础模型:https://saiyan-world.github.io/goku/ 2025-02-19
44 MuseV TMElyralab 2.8k 304 Python 66 MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel DenoisingMuseV:具有视觉条件并行降噪功能的无限长度和高保真虚拟人类视频生成 2024-06-28
45 open-chat-video-editor SCUTlihaoyu 2.8k 366 Python 29 Open source short video automatic generation tool开源短视频自动生成工具 2023-06-20
46 ViMax HKUDS 2.7k 503 Python 22 "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"“ViMax:代理视频生成(导演、编剧、制片人和视频生成器一体化)” 2026-03-29
47 kubric google-research 2.7k 273 Jupyter Notebook 69 A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.用于创建具有丰富注释(例如实例分割掩模、深度图和光流)的半真实合成多对象视频的数据生成管道。 2025-05-06
48 MusePose TMElyralab 2.7k 199 Python 50 MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human GenerationMusePose:用于生成虚拟人的姿势驱动的图像到视频框架 2025-03-05
49 MimicMotion Tencent 2.6k 231 Python 89 High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance具有置信度感知姿势指导的高质量人体运动视频生成 2025-11-18
50 ttt-video-dit test-time-training 2.4k 7 Python 8 Official PyTorch implementation of One-Minute Video Generation with Test-Time Training通过测试时训练生成一分钟视频的官方 PyTorch 实现 2026-02-25
51 Stable-Video-Infinity vita-epfl 2.4k 207 Python 36 [ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling【ICLR 26 Oral】稳定视频无限:带错误回收的无限长度视频生成 2026-01-19
52 EasyAnimate aigc-apps 2.3k 183 Python 93 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion📺 基于 Transformer Diffusion 的高分辨率长视频生成端到端解决方案 2025-03-06
53 Paper2Video showlab 2.2k 321 Python 4 Automatic Video Generation from Scientific Papers从科学论文自动生成视频 2026-03-05
54 LightX2V ModelTC 2.2k 195 Python 147 Light Image Video Generation Inference Framework光图像视频生成推理框架 2026-05-02
55 Matrix-Game SkyworkAI 2.2k 236 Python 25 Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon MemoryMatrix-Game 3.0:具有长视野记忆的实时流式交互世界模型 2026-03-30
56 ArcReel ArcReel 2.1k 444 Python 15 AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI 2026-05-05
57 VideoSys NUS-HPC-AI-Lab 2.0k 131 Python 22 VideoSys: An easy and efficient system for video generationVideoSys:简单高效的视频生成系统 2025-08-27
58 Latte Vchitect 1.9k 192 Python 0 [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation. [TMLR 2025] Latte:用于视频生成的潜在扩散变压器。 2025-10-30
59 ReCamMaster KlingAIResearch 1.8k 92 Python 65 [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video[ICCV'25 最佳论文入围] ReCamMaster:来自单个视频的摄像机控制生成渲染 2025-11-28
60 dreamtalk ali-vilab 1.8k 220 Python 44 Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models论文的官方实现:DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models 2024-01-15
61 py-gpt szczyglis-dev 1.8k 323 Python 43 Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac桌面 AI 助手由 GPT-5、GPT-4、o1、o3、Gemini、Claude、Ollama、DeepSeek、Perplexity、Grok、Bielik、聊天、视觉、语音、RAG、图像和视频生成、代理、工具、MCP、插件、语音合成和识别、网络搜索、内存、预设、助手等提供支持。 Linux、Windows、Mac 2026-02-06
62 Helios PKU-YuanGroup 1.8k 134 Python 23 Helios: Real Real-Time Long Video Generation ModelHelios:实时长视频生成模型 2026-04-16
63 Code2Video showlab 1.7k 244 Python 0 Video generation via code通过代码生成视频 2026-05-01
64 awesome-seedance ZeroLu 1.7k 202 Shell 0 The ultimate collection of high-fidelity Seedance 2.0 prompts and Seedance AI resources. Discover Seedance 2.0 how to use for cinematic film, anime, UGC, social media, meme and advertising. Includes Seedance API guides and advanced video generation workflows.高保真 Seedance 2.0 提示和 Seedance AI 资源的终极集合。了解 Seedance 2.0 如何用于电影、动漫、UGC、社交媒体、模因和广告。包括 Seedance API 指南和高级视频生成工作流程。 2026-05-05
65 ControlNeXt JIA-Lab-research 1.6k 80 Python 51 Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA可控视频和图像生成、SVD、Animate Anybody、ControlNet、ControlNeXt、LoRA 2024-09-25
66 StreamingT2V Picsart-AI-Research 1.6k 159 Python 40 [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text[CVPR 2025] StreamingT2V:从文本生成一致、动态且可扩展的长视频 2025-03-27
67 VBench Vchitect 1.6k 114 Python 62 [CVPR2024 Highlight] VBench - We Evaluate Video Generation【CVPR2024亮点】VBench - 我们评估视频生成 2026-03-23
68 Awesome-World-Models leofan90 1.6k 51 N/A 0 A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.有关世界模型定义以及使用世界模型进行通用视频生成、嵌入式人工智能和自动驾驶的论文的完整列表,包括论文、代码和相关网站。 2026-05-01
69 Mora lichao-sun 1.6k 110 Python 13 Mora: More like Sora for Generalist Video GenerationMora:更像 Sora,用于通用视频生成 2024-10-10
70 MuseBot yincongcyincong 1.6k 234 Go 3 supports Telegram, Discord, Slack, Lark(飞书),钉钉, 企业微信, QQ, 微信, compatible with various LLMs including OpenAI, Gemini, DeepSeek, Doubao, and OpenRouter. It offers intelligent conversation, image generation, video creation, and more. Works seamlessly in both private chats and group settings.支持Telegram、Discord、Slack、Lark(飞书)、钉钉、企业微信、QQ、微信,兼容OpenAI、Gemini、DeepSeek、豆宝、OpenRouter等多种LLM。它提供智能对话、图像生成、视频创建等功能。在私人聊天和群组设置中无缝工作。 2026-04-01
71 MIMO menyifang 1.6k 72 Python 34 Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"《MIMO:空间分解建模的可控字符视频合成》正式实现 2025-06-19
72 HunyuanWorld-Voyager Tencent-Hunyuan 1.5k 164 Python 27 Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.Voyager 是一种以摄像机输入为条件的交互式 RGBD 视频生成模型,支持实时 3D 重建。 2026-04-15
73 Phantom Phantom-video 1.5k 97 Python 39 Phantom: Subject-Consistent Video Generation via Cross-Modal AlignmentPhantom:通过跨模态对齐生成主题一致的视频 2025-09-11
74 MiniMax-MCP MiniMax-AI 1.5k 257 Python 20 Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.官方 MiniMax 模型上下文协议 (MCP) 服务器,支持与强大的文本转语音、图像生成和视频生成 API 进行交互。 2026-04-15
75 video-diffusion-pytorch lucidrains 1.4k 140 Python 27 Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch视频扩散模型的实现,Jonathan Ho 的新论文将 DDPM 扩展到视频生成 - 在 Pytorch 中 2024-05-03
76 FollowYourPose mayuelala 1.4k 95 Python 12 [AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos" [AAAI 2024] Follow-Your-Pose:此存储库是“Follow-Your-Pose:使用无姿势视频的姿势引导文本到视频生成”的官方实现 2024-03-20
77 MagicTime PKU-YuanGroup 1.3k 123 Python 9 [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators[TPAMI 2025🔥] MagicTime:延时视频生成模型作为变形模拟器 2026-04-14
78 GEN3C nv-tlabs 1.3k 73 Jupyter Notebook 31 [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control[CVPR 2025 亮点]GEN3C:通过精确的摄像机控制生成 3D 信息的世界一致视频 2025-09-24
79 TeaCache ali-vilab 1.3k 56 Python 29 Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model时间步嵌入告诉我们:是时候缓存视频扩散模型了 2025-06-08
80 minisora mini-sora 1.3k 147 Python 4 MiniSora: A community aims to explore the implementation path and future development direction of Sora.MiniSora:一个旨在探索Sora的实现路径和未来发展方向的社区。 2025-02-18
81 articulated-animation snap-research 1.3k 351 Jupyter Notebook 54 Code for Motion Representations for Articulated Animation paper铰接动画运动表示代码论文 2025-06-01
82 gemini-business2api yukkcat 1.3k 953 Python 34 OpenAI-compatible API for Gemini Business with multi-account load balancing and multimodal capabilities (image/video generation, file parsing) | 将 Gemini Business 转为 OpenAI 兼容接口,支持多账户负载均衡及多模态能力(图像生成、视频生成、解析文件) 2026-04-30
83 StableAvatar Francis-Rings 1.2k 108 Python 1 We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a reference image and audio.我们推出了 StableAvatar,这是第一个端到端视频扩散转换器,它以参考图像和音频为条件,合成无限长度的高质量音频驱动的头像视频,无需任何后处理。 2026-01-20
84 Tora alibaba 1.2k 59 Python 11 [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation[CVPR'25]Tora:用于视频生成的轨迹导向扩散变压器 2026-04-14
85 UForm unum-cloud 1.2k 78 Python 15 Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️袖珍多模态 AI,用于跨多语言文本、图像和🔜视频进行内容理解和生成,速度比 OpenAI CLIP 和 LLaVA 🖼️ & 🖋️ 快 5 倍 2025-10-30
86 HuMo Phantom-video 1.2k 234 Python 16 HuMo: Human-Centric Video Generation via Collaborative Multi-Modal ConditioningHuMo:通过协作多模态调节以人为中心的视频生成 2026-01-25
87 HunyuanCustom Tencent-Hunyuan 1.2k 108 Python 35 HunyuanCustom: A Multimodal-Driven Architecture for Customized Video GenerationHunyuanCustom:用于定制视频生成的多模态驱动架构 2025-10-15
88 LongLive NVlabs 1.2k 112 Python 9 [ICLR 2026] LongLive: Real-time Interactive Long Video Generation[ICLR 2026] LongLive:实时交互式长视频生成 2026-02-26
89 UniAnimate ali-vilab 1.2k 61 Python 63 Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".SCIS-2025 论文“UniAnimate:驯服统一视频扩散模型以实现一致的人类图像动画”的代码。 2025-04-15
90 MagicDrive cure-lab 1.2k 49 Python 8 [ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”[ICLR24] 论文《MagicDrive: Street View Generation with Diverse 3D Geometry Control》正式实现 2025-04-21
91 Show-1 showlab 1.1k 59 Python 9 [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation[IJCV] Show-1:将像素和潜在扩散模型结合起来生成文本到视频 2025-09-13
92 cosmos-predict2.5 nvidia-cosmos 1.1k 152 Python 20 Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.Cosmos-Predict2.5是Cosmos世界基础模型(WFM)系列的最新版本,专门用于以视频形式模拟和预测世界的未来状态。 2026-05-04
93 wunjo.wladradchenko.ru wladradchenko 1.1k 121 JavaScript 0 Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.Wunjo CE:面部交换、唇形同步、控制删除对象、文本和背景、重新造型、音频分离器、克隆语音、视频生成。开源、本地且免费。 2026-02-03
94 short-video-maker gyoridavid 1.1k 370 TypeScript 21 Creates short videos for TikTok, Instagram Reels, and YouTube Shorts using the Model Context Protocol (MCP) and a REST API.使用模型上下文协议 (MCP) 和 REST API 为 TikTok、Instagram Reels 和 YouTube Shorts 创建短视频。 2025-06-21
95 ShareGPT4Video ShareGPT4Omni 1.1k 45 Python 24 [NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"[NeurIPS 2024]“ShareGPT4Video:通过更好的字幕提高视频理解和生成”的正式实现 2024-10-09
96 memo memoavatar 1.1k 105 Python 20 [TMLR] Memory-Guided Diffusion for Expressive Talking Video Generation[TMLR] 用于生成富有表现力的谈话视频的记忆引导扩散 2025-08-06
97 MotionDirector showlab 1.0k 61 Python 25 [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.[ECCV 2024 Oral] MotionDirector:文本到视频扩散模型的运动定制。 2024-08-21
98 Motus thu-ml 1.0k 52 Python 25 Official code of Motus: A Unified Latent Action World ModelMotus官方代码:统一的潜在行动世界模型 2026-01-05
99 CVPR2022-DaGAN harlanhong 1.0k 128 Python 33 Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video GenerationCVPR2022论文官方代码:Depth-Aware Generative Adversarial Network for Talking Head Video Generation 2023-12-04
100 magvit google-research 998 47 Python 21 Official JAX implementation of MAGVIT: Masked Generative Video TransformerMAGVIT 的官方 JAX 实现:Masked Generative Video Transformer 2024-01-17
No repositories match your search 没有匹配的仓库