🎬 Top 100 · Video Generation前 100 · 视频生成

100 repositories sorted by video generation 按视频生成排序，共 100 个仓库

⌕

📦 100 repos个仓库 🕐 2026-05-06

#	Repository仓库	Stars	Forks	Language语言	Issues	Description描述	Last Commit最后提交
1	diffusers huggingface	33.6k	7.0k	Python	716	🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.🤗 扩散器：PyTorch 中用于生成图像、视频和音频的最先进的扩散模型。	2026-05-06
2	LivePortrait KlingAIResearch	18.3k	1.9k	Python	275	Bring portraits to life!让肖像栩栩如生！	2026-03-02
3	Wan2.2 Wan-Video	15.6k	1.9k	Python	238	Wan: Open and Advanced Large-Scale Video Generative ModelsWan：开放且先进的大规模视频生成模型	2026-03-17
4	Duix-Avatar duixcom	12.9k	2.1k	C	397	🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.🚀 真正开源的人工智能头像（数字人）工具包，用于离线视频生成和数字人克隆。	2026-04-21
5	CogVideo zai-org	12.7k	1.3k	Python	104	text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)文本和图像到视频生成：CogVideoX (2024) 和 CogVideo (ICLR 2023)	2025-11-04
6	HunyuanVideo Tencent-Hunyuan	12.1k	1.2k	Python	158	HunyuanVideo: A Systematic Framework For Large Video Generation Model混源视频：大视频生成模型的系统框架	2025-11-21
7	waoowaoo saturndec	12.0k	2.7k	TypeScript	118	首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.首家工业级全流程 AI 影视生产平台。 Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.	2026-05-04
8	Pixelle-Video AIDC-AI	11.7k	1.8k	Python	67	🚀 AI 全自动短视频引擎 \| AI Fully Automated Short Video Engine	2026-04-13
9	Open-Generative-AI Anil-matcha	11.5k	2.0k	JavaScript	5	Uncensored, open-source alternative to Higgsfield AI, Freepik AI, Krea AI, Openart AI — Free, unrestricted AI image & video generation studio with 200+ models (Flux, Midjourney, Kling, Sora, Veo). No content filters. Self-hosted, MIT licensed.Higgsfield AI、Freepik AI、Krea AI、Openart AI 的未经审查、开源替代方案 — 免费、不受限制的 AI 图像和视频生成工作室，拥有 200 多个模型（Flux、Midjourney、Kling、Sora、Veo）。没有内容过滤器。自托管，麻省理工学院许可。	2026-05-05
10	imaginAIry brycedrennan	8.1k	474	Python	22	Pythonic AI generation of images and videosPythonic AI 生成图像和视频	2026-02-24
11	Toonflow-app HBAI-Ltd	7.6k	1.3k	HTML	1	Toonflow 是开源一站式 AI 短剧创作工具，将小说、剧本快速转化为动画短剧。集成 AI 编剧、智能分镜、角色与视频生成，跨平台桌面端轻量部署，助力创作者低成本批量产出视觉内容。Toonflow is an open-source AI tool that turns stories and scripts into animated short dramas. Features AI scriptwriting, storyboarding, character and video generation. A cross-platform desktop app for efficient content creation.Toonflow 是开源一站式 AI 短剧创作工具，将小说、剧本快速转化为动画短剧。集成 AI 编剧、智能分镜、角色与视频生成，跨平台桌面端轻量部署，助力创作者低成本批量产出视觉内容。 Toonflow is an open-source AI tool that turns stories and scripts into animated short dramas. Features AI scriptwriting, storyboarding, character and video generation. A cross-platform desktop app for efficient content creation.	2026-05-01
12	mmagic open-mmlab	7.4k	1.1k	Jupyter Notebook	61	OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.OpenMMLab 多模式高级、生成和智能创建工具箱。解锁魔法🪄：生成式人工智能 (AIGC)、易于使用的 API、出色的模型动物园、扩散模型，用于文本到图像生成、图像/视频恢复/增强等。	2024-08-06
13	InfiniteTalk MeiGen-AI	6.5k	1.1k	Python	156	Unlimited-length talking video generation that supports image-to-video and video-to-video generation无限长度的对话视频生成，支持图像到视频和视频到视频的生成	2025-12-18
14	Awesome-Video-Diffusion showlab	5.6k	357	N/A	0	A curated list of recent diffusion models for video generation, editing, and various other applications.用于视频生成、编辑和各种其他应用的最新扩散模型的精选列表。	2026-04-03
15	Sana NVlabs	5.1k	346	Python	99	SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion TransformerSANA：使用线性扩散变压器进行高效高分辨率图像合成	2026-04-14
16	autoclip zhouxiaoka	5.1k	1.1k	Python	48	AutoClip : AI-powered video clipping and highlight generation · 一款智能高光提取与剪辑的二创工具	2025-09-24
17	VideoCrafter AILab-CVC	5.1k	410	Python	71	VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion ModelsVideoCrafter2：克服高质量视频扩散模型的数据限制	2026-01-09
18	mmaction2 open-mmlab	5.0k	1.4k	Python	284	OpenMMLab's Next Generation Video Understanding Toolbox and BenchmarkOpenMMLab 的下一代视频理解工具箱和基准	2026-03-18
19	vllm-omni vllm-project	4.6k	876	Python	385	A framework for efficient model inference with omni-modality models全模态模型的高效模型推理框架	2026-05-06
20	echomimic_v2 antgroup	4.6k	535	Python	74	[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation[CVPR 2025] EchoMimicV2：迈向引人注目、简化和半身人体动画	2026-02-23
21	HunyuanVideo-1.5 Tencent-Hunyuan	4.4k	225	Python	29	HunyuanVideo-1.5: A leading lightweight video generation modelHunyuanVideo-1.5：领先的轻量级视频生成模型	2026-04-10
22	Tune-A-Video showlab	4.4k	391	Python	36	[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation[ICCV 2023] Tune-A-Video：用于文本到视频生成的图像扩散模型的一次性调整	2023-10-25
23	champ fudan-generative-vision	4.3k	483	Python	49	[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance[ECCV 2024] Champ：具有 3D 参数化指导的可控且一致的人体图像动画	2024-07-10
24	Text2Video-Zero Picsart-AI-Research	4.2k	386	Python	47	[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators[ICCV 2023 Oral] 文本到图像扩散模型是零样本视频生成器	2023-05-06
25	Qwen2.5-Omni QwenLM	4.0k	323	Jupyter Notebook	213	Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.Error 500 (Server Error)!!1500.That’s an error.There was an error. Please try again later.That’s all we know.	2025-06-12
26	short-video-factory YILS-LIN	3.9k	577	TypeScript	22	一键生成产品营销与泛内容短视频，AI批量自动剪辑，高颜值跨平台桌面端工具 One click generation of product marketing and general content short videos, AI batch automatic cliping, beautiful cross platform desktop tool	2026-04-07
27	VACE ali-vilab	3.8k	261	Python	54	[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing[ICCV 2025] 论文的官方实现：VACE: All-in-One Video Creation and Editing	2025-10-17
28	MAGI-1 SandAI-org	3.7k	237	Python	38	MAGI-1: Autoregressive Video Generation at ScaleMAGI-1：大规模自回归视频生成	2025-06-17
29	lingbot-world Robbyant	3.7k	316	Python	25	Advancing Open-source World Models推进开源世界模式	2026-04-10
30	mochi genmoai	3.6k	476	Python	50	The best OSS video generation models, created by Genmo最好的 OSS 视频生成模型，由 Genmo 创建	2025-11-14
31	TurboDiffusion thu-ml	3.5k	253	Python	68	TurboDiffusion: 100–200× Acceleration for Video Diffusion ModelsTurboDiffusion：视频扩散模型的 100–200× 加速	2026-04-15
32	OpenMontage calesthio	3.5k	689	Python	16	World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.世界上第一个开源的代理视频制作系统。 12条管道、52种工具、500+座席技能。将您的人工智能编码助手变成一个完整的视频制作工作室。	2026-04-28
33	FastVideo hao-ai-lab	3.4k	328	Python	63	A unified inference and post-training framework for accelerated video generation.用于加速视频生成的统一推理和后训练框架。	2026-05-05
34	SageAttention thu-ml	3.3k	404	Cuda	162	[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.[ICLR2025、ICML2025、NeurIPS2025 Spotlight] 与 FlashAttention 相比，量化注意力实现了 2-5 倍的加速，并且不会丢失跨语言、图像和视频模型的端到端指标。	2026-01-17
35	Jellyfish Forget-C	3.2k	581	Python	5	An end-to-end production workspace for AI-generated short dramas. From script input to structured storyboarding, consistency management, shot preparation, video generation, and export.用于人工智能生成的短剧的端到端制作工作区。从脚本输入到结构化故事板、一致性管理、镜头准备、视频生成和导出。	2026-04-20
36	InternGPT OpenGVLab	3.2k	235	Python	19	InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models.现在它支持DragGAN、ChatGPT、ImageBind、多模式聊天（如GPT-4、SAM）、交互式图像编辑等。请在igpt.opengvlab.com上尝试（支持DragGAN、ChatGPT、ImageBind、SAM的在线演示系统）	2024-08-20
37	Pyramid-Flow jy0205	3.2k	299	Python	68	[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling[ICLR 2025] 用于高效视频生成建模的金字塔流匹配	2024-12-21
38	Generative-Media-Skills SamurAIGPT	3.2k	349	Shell	0	Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.AI 代理的多模式生成媒体技能（Claude Code、Cursor、Gemini CLI）。由 muapi.ai 提供支持的高质量图像、视频和音频生成。	2026-05-02
39	VGen ali-vilab	3.2k	274	Python	112	Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion modelsVGen 的官方存储库：基于扩散模型的视频生成的整体视频生成生态系统	2025-01-10
40	DynamiCrafter Doubiiu	3.0k	246	Python	86	[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors[ECCV 2024，口头] DynamiCrafter：使用视频扩散先验对开放域图像进行动画处理	2024-09-08
41	PersonaLive GVCLab	2.9k	405	Python	30	[CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming[CVPR 2026] PersonaLive！：用于直播的富有表现力的肖像图像动画	2026-03-05
42	MultiTalk MeiGen-AI	2.9k	485	Python	150	[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation[NeurIPS 2025] 让他们说话：音频驱动的多人对话视频生成	2025-12-18
43	goku Saiyan-World	2.9k	311	Python	0	[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/【CVPR2025亮点】视频生成基础模型：https://saiyan-world.github.io/goku/	2025-02-19
44	MuseV TMElyralab	2.8k	304	Python	66	MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel DenoisingMuseV：具有视觉条件并行降噪功能的无限长度和高保真虚拟人类视频生成	2024-06-28
45	open-chat-video-editor SCUTlihaoyu	2.8k	366	Python	29	Open source short video automatic generation tool开源短视频自动生成工具	2023-06-20
46	ViMax HKUDS	2.7k	503	Python	22	"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"“ViMax：代理视频生成（导演、编剧、制片人和视频生成器一体化）”	2026-03-29
47	kubric google-research	2.7k	273	Jupyter Notebook	69	A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.用于创建具有丰富注释（例如实例分割掩模、深度图和光流）的半真实合成多对象视频的数据生成管道。	2025-05-06
48	MusePose TMElyralab	2.7k	199	Python	50	MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human GenerationMusePose：用于生成虚拟人的姿势驱动的图像到视频框架	2025-03-05
49	MimicMotion Tencent	2.6k	231	Python	89	High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance具有置信度感知姿势指导的高质量人体运动视频生成	2025-11-18
50	ttt-video-dit test-time-training	2.4k	7	Python	8	Official PyTorch implementation of One-Minute Video Generation with Test-Time Training通过测试时训练生成一分钟视频的官方 PyTorch 实现	2026-02-25
51	Stable-Video-Infinity vita-epfl	2.4k	207	Python	36	[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling【ICLR 26 Oral】稳定视频无限：带错误回收的无限长度视频生成	2026-01-19
52	EasyAnimate aigc-apps	2.3k	183	Python	93	📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion📺 基于 Transformer Diffusion 的高分辨率长视频生成端到端解决方案	2025-03-06
53	Paper2Video showlab	2.2k	321	Python	4	Automatic Video Generation from Scientific Papers从科学论文自动生成视频	2026-03-05
54	LightX2V ModelTC	2.2k	195	Python	147	Light Image Video Generation Inference Framework光图像视频生成推理框架	2026-05-02
55	Matrix-Game SkyworkAI	2.2k	236	Python	25	Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon MemoryMatrix-Game 3.0：具有长视野记忆的实时流式交互世界模型	2026-03-30
56	ArcReel ArcReel	2.1k	444	Python	15	AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频，跨镜头角色与场景一致 \| Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI	2026-05-05
57	VideoSys NUS-HPC-AI-Lab	2.0k	131	Python	22	VideoSys: An easy and efficient system for video generationVideoSys：简单高效的视频生成系统	2025-08-27
58	Latte Vchitect	1.9k	192	Python	0	[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation. [TMLR 2025] Latte：用于视频生成的潜在扩散变压器。	2025-10-30
59	ReCamMaster KlingAIResearch	1.8k	92	Python	65	[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video[ICCV'25 最佳论文入围] ReCamMaster：来自单个视频的摄像机控制生成渲染	2025-11-28
60	dreamtalk ali-vilab	1.8k	220	Python	44	Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models论文的官方实现：DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models	2024-01-15
61	py-gpt szczyglis-dev	1.8k	323	Python	43	Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac桌面 AI 助手由 GPT-5、GPT-4、o1、o3、Gemini、Claude、Ollama、DeepSeek、Perplexity、Grok、Bielik、聊天、视觉、语音、RAG、图像和视频生成、代理、工具、MCP、插件、语音合成和识别、网络搜索、内存、预设、助手等提供支持。 Linux、Windows、Mac	2026-02-06
62	Helios PKU-YuanGroup	1.8k	134	Python	23	Helios: Real Real-Time Long Video Generation ModelHelios：实时长视频生成模型	2026-04-16
63	Code2Video showlab	1.7k	244	Python	0	Video generation via code通过代码生成视频	2026-05-01
64	awesome-seedance ZeroLu	1.7k	202	Shell	0	The ultimate collection of high-fidelity Seedance 2.0 prompts and Seedance AI resources. Discover Seedance 2.0 how to use for cinematic film, anime, UGC, social media, meme and advertising. Includes Seedance API guides and advanced video generation workflows.高保真 Seedance 2.0 提示和 Seedance AI 资源的终极集合。了解 Seedance 2.0 如何用于电影、动漫、UGC、社交媒体、模因和广告。包括 Seedance API 指南和高级视频生成工作流程。	2026-05-05
65	ControlNeXt JIA-Lab-research	1.6k	80	Python	51	Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA可控视频和图像生成、SVD、Animate Anybody、ControlNet、ControlNeXt、LoRA	2024-09-25
66	StreamingT2V Picsart-AI-Research	1.6k	159	Python	40	[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text[CVPR 2025] StreamingT2V：从文本生成一致、动态且可扩展的长视频	2025-03-27
67	VBench Vchitect	1.6k	114	Python	62	[CVPR2024 Highlight] VBench - We Evaluate Video Generation【CVPR2024亮点】VBench - 我们评估视频生成	2026-03-23
68	Awesome-World-Models leofan90	1.6k	51	N/A	0	A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.有关世界模型定义以及使用世界模型进行通用视频生成、嵌入式人工智能和自动驾驶的论文的完整列表，包括论文、代码和相关网站。	2026-05-01
69	Mora lichao-sun	1.6k	110	Python	13	Mora: More like Sora for Generalist Video GenerationMora：更像 Sora，用于通用视频生成	2024-10-10
70	MuseBot yincongcyincong	1.6k	234	Go	3	supports Telegram, Discord, Slack, Lark（飞书），钉钉, 企业微信, QQ, 微信, compatible with various LLMs including OpenAI, Gemini, DeepSeek, Doubao, and OpenRouter. It offers intelligent conversation, image generation, video creation, and more. Works seamlessly in both private chats and group settings.支持Telegram、Discord、Slack、Lark（飞书）、钉钉、企业微信、QQ、微信，兼容OpenAI、Gemini、DeepSeek、豆宝、OpenRouter等多种LLM。它提供智能对话、图像生成、视频创建等功能。在私人聊天和群组设置中无缝工作。	2026-04-01
71	MIMO menyifang	1.6k	72	Python	34	Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"《MIMO：空间分解建模的可控字符视频合成》正式实现	2025-06-19
72	HunyuanWorld-Voyager Tencent-Hunyuan	1.5k	164	Python	27	Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.Voyager 是一种以摄像机输入为条件的交互式 RGBD 视频生成模型，支持实时 3D 重建。	2026-04-15
73	Phantom Phantom-video	1.5k	97	Python	39	Phantom: Subject-Consistent Video Generation via Cross-Modal AlignmentPhantom：通过跨模态对齐生成主题一致的视频	2025-09-11
74	MiniMax-MCP MiniMax-AI	1.5k	257	Python	20	Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.官方 MiniMax 模型上下文协议 (MCP) 服务器，支持与强大的文本转语音、图像生成和视频生成 API 进行交互。	2026-04-15
75	video-diffusion-pytorch lucidrains	1.4k	140	Python	27	Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch视频扩散模型的实现，Jonathan Ho 的新论文将 DDPM 扩展到视频生成 - 在 Pytorch 中	2024-05-03
76	FollowYourPose mayuelala	1.4k	95	Python	12	[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos" [AAAI 2024] Follow-Your-Pose：此存储库是“Follow-Your-Pose：使用无姿势视频的姿势引导文本到视频生成”的官方实现	2024-03-20
77	MagicTime PKU-YuanGroup	1.3k	123	Python	9	[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators[TPAMI 2025🔥] MagicTime：延时视频生成模型作为变形模拟器	2026-04-14
78	GEN3C nv-tlabs	1.3k	73	Jupyter Notebook	31	[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control[CVPR 2025 亮点]GEN3C：通过精确的摄像机控制生成 3D 信息的世界一致视频	2025-09-24
79	TeaCache ali-vilab	1.3k	56	Python	29	Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model时间步嵌入告诉我们：是时候缓存视频扩散模型了	2025-06-08
80	minisora mini-sora	1.3k	147	Python	4	MiniSora: A community aims to explore the implementation path and future development direction of Sora.MiniSora：一个旨在探索Sora的实现路径和未来发展方向的社区。	2025-02-18
81	articulated-animation snap-research	1.3k	351	Jupyter Notebook	54	Code for Motion Representations for Articulated Animation paper铰接动画运动表示代码论文	2025-06-01
82	gemini-business2api yukkcat	1.3k	953	Python	34	OpenAI-compatible API for Gemini Business with multi-account load balancing and multimodal capabilities (image/video generation, file parsing) \| 将 Gemini Business 转为 OpenAI 兼容接口，支持多账户负载均衡及多模态能力（图像生成、视频生成、解析文件）	2026-04-30
83	StableAvatar Francis-Rings	1.2k	108	Python	1	We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a reference image and audio.我们推出了 StableAvatar，这是第一个端到端视频扩散转换器，它以参考图像和音频为条件，合成无限长度的高质量音频驱动的头像视频，无需任何后处理。	2026-01-20
84	Tora alibaba	1.2k	59	Python	11	[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation[CVPR'25]Tora：用于视频生成的轨迹导向扩散变压器	2026-04-14
85	UForm unum-cloud	1.2k	78	Python	15	Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️袖珍多模态 AI，用于跨多语言文本、图像和🔜视频进行内容理解和生成，速度比 OpenAI CLIP 和 LLaVA 🖼️ & 🖋️ 快 5 倍	2025-10-30
86	HuMo Phantom-video	1.2k	234	Python	16	HuMo: Human-Centric Video Generation via Collaborative Multi-Modal ConditioningHuMo：通过协作多模态调节以人为中心的视频生成	2026-01-25
87	HunyuanCustom Tencent-Hunyuan	1.2k	108	Python	35	HunyuanCustom: A Multimodal-Driven Architecture for Customized Video GenerationHunyuanCustom：用于定制视频生成的多模态驱动架构	2025-10-15
88	LongLive NVlabs	1.2k	112	Python	9	[ICLR 2026] LongLive: Real-time Interactive Long Video Generation[ICLR 2026] LongLive：实时交互式长视频生成	2026-02-26
89	UniAnimate ali-vilab	1.2k	61	Python	63	Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".SCIS-2025 论文“UniAnimate：驯服统一视频扩散模型以实现一致的人类图像动画”的代码。	2025-04-15
90	MagicDrive cure-lab	1.2k	49	Python	8	[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”[ICLR24] 论文《MagicDrive: Street View Generation with Diverse 3D Geometry Control》正式实现	2025-04-21
91	Show-1 showlab	1.1k	59	Python	9	[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation[IJCV] Show-1：将像素和潜在扩散模型结合起来生成文本到视频	2025-09-13
92	cosmos-predict2.5 nvidia-cosmos	1.1k	152	Python	20	Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.Cosmos-Predict2.5是Cosmos世界基础模型（WFM）系列的最新版本，专门用于以视频形式模拟和预测世界的未来状态。	2026-05-04
93	wunjo.wladradchenko.ru wladradchenko	1.1k	121	JavaScript	0	Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.Wunjo CE：面部交换、唇形同步、控制删除对象、文本和背景、重新造型、音频分离器、克隆语音、视频生成。开源、本地且免费。	2026-02-03
94	short-video-maker gyoridavid	1.1k	370	TypeScript	21	Creates short videos for TikTok, Instagram Reels, and YouTube Shorts using the Model Context Protocol (MCP) and a REST API.使用模型上下文协议 (MCP) 和 REST API 为 TikTok、Instagram Reels 和 YouTube Shorts 创建短视频。	2025-06-21
95	ShareGPT4Video ShareGPT4Omni	1.1k	45	Python	24	[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"[NeurIPS 2024]“ShareGPT4Video：通过更好的字幕提高视频理解和生成”的正式实现	2024-10-09
96	memo memoavatar	1.1k	105	Python	20	[TMLR] Memory-Guided Diffusion for Expressive Talking Video Generation[TMLR] 用于生成富有表现力的谈话视频的记忆引导扩散	2025-08-06
97	MotionDirector showlab	1.0k	61	Python	25	[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.[ECCV 2024 Oral] MotionDirector：文本到视频扩散模型的运动定制。	2024-08-21
98	Motus thu-ml	1.0k	52	Python	25	Official code of Motus: A Unified Latent Action World ModelMotus官方代码：统一的潜在行动世界模型	2026-01-05
99	CVPR2022-DaGAN harlanhong	1.0k	128	Python	33	Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video GenerationCVPR2022论文官方代码：Depth-Aware Generative Adversarial Network for Talking Head Video Generation	2023-12-04
100	magvit google-research	998	47	Python	21	Official JAX implementation of MAGVIT: Masked Generative Video TransformerMAGVIT 的官方 JAX 实现：Masked Generative Video Transformer	2024-01-17

No repositories match your search 没有匹配的仓库