ACGC 项目资源集

AI 发展日新月异, 各种项目/应用/工具/资源… 令人兴奋, 而且很多都是免费使用, 或者开源.

内容主要分为 项目的官方网站, 项目的 GitHub 主页, 项目的 Hugging Face Space (可在线试用), AIGC 相关网站, 相关文档资料. 它们之间可能会有重叠的项目, 但非常少, 我做了去重, 确保唯一性.

以下项目是截至 2024 年 12 月 9 日搜集整理并分类:

项目的官方网站

项目的 GitHub 主页

Hugging Face Space

AIGC 相关网站

相关文档资料

等待中的项目

后面新添加, 都会标注日期:

项目官方网站

Lovable 2025-01-02

Monica - ChatGPT AI Assistant | GPT-4o, Claude 3.5, Gemini 1.5 2025-01-02

Voicenotes: Transcribe notes, meetings & ask AI 2025-01-02

YouMind - AI Creation System 2025-01-02

GitHub 项目

IDEA-Research/GroundingDINO: [ECCV 2024] Official implementation of the paper “Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection” 2025-01-02

Yuan-ManX/ai-game-devtools: Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥 2025-01-02

yandex-research/switti: The code and models for the paper: Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis 2025-01-02

FreedomIntelligence/HuatuoGPT-o1: Medical o1, Towards medical complex reasoning with LLMs 2025-01-02

vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs 2025-01-02

sgl-project/sglang: SGLang is a fast serving framework for large language models and vision language models. 2025-01-02

modelscope/DiffSynth-Studio: Enjoy the magic of Diffusion models! 2025-01-02

cline/cline: Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way. 2025-01-02

OpenDriveLab/AgiBot-World: World’s First Large-scale High-quality Robotic Manipulation Benchmark 2025-01-02

huggingface/smolagents: 🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents. 2025-01-02

TMElyralab/MusePose: MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation 2025-01-02

SpatialVision/Orient-Anything 2025-01-02

ixarchakos/try-off-anyone: Official repository of “TryOffAnyone: Tiled Cloth Generation from a Dressed Person” 2025-01-02

Hugging Face

MMAudio — generating synchronized audio from video/text - a Hugging Face Space by hkchengrex 2025-01-02

Anychat - a Hugging Face Space by akhaliq 2025-01-02

FacePoke - a Hugging Face Space by jbilcke-hf 2025-01-02

AI Comic Factory - a Hugging Face Space by jbilcke-hf 2025-01-02

Switti - a Hugging Face Space by dbaranchuk 2025-01-02

Dokdo Multimodal - a Hugging Face Space by ginipick 2025-01-02

Dokdo - a Hugging Face Space by ginigen 2025-01-02

等待中的项目

Feat2GS 2025-01-02

GenHMR: Generative Human Mesh Recovery 2025-01-02

1.58-bit FLUX 2025-01-02

HSfM 2025-01-02

PERSE: Personalized 3D Generative Avatars from A Single Portrait 2025-01-02

VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control 2025-01-02

项目官方网站

Project Odyssey 2024-12-30

在线运行 ComfyUI 工作流并一键部署 API - ComfyOnline 2024-12-30

智谱AI开放平台 2024-12-30

Replit – Build apps and sites with AI 2024-12-30

AIGCPanel | 开源AI数字人系统 2024-12-30

阶跃星辰开放平台 2024-12-30

Fireworks - Fastest Inference for Generative AI 2024-12-30

百川大模型-汇聚世界知识创作妙笔生花-百川智能 2024-12-30

DomoAI | AI Art Generator & Video to Animation Converter 2024-12-30

CreateAI 2024-12-30

Magnific AI — The magic image Upscaler & Enhancer 2024-12-30

Odyssey 2024-12-30

Nexa AI | Enterprise-Grade On-Device AI for Every Device 2024-12-30

Humane Ai Pin | See the World, Not Your Screen. | Humane 2024-12-30

书生 2024-12-30

Taipy — Build Python Data & BI web applications 2024-12-30

GitHub 项目

facebookresearch/blt: Code for BLT research paper 2024-12-30

VideoVerses/VideoVAEPlus 2024-12-30

TencentARC/DI-PCG: Code release of our paper “DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation”. 2024-12-30

QwenLM/Qwen2-VL: Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. 2024-12-30

web-infra-dev/midscene: An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language. 2024-12-30

hpcaitech/Open-Sora: Open-Sora: Democratizing Efficient Video Production for All 2024-12-30

osanseviero/geminiCoder: Create apps with Gemini 2024-12-30

IamCreateAI/Ruyi-Models 2024-12-30

rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in PyTorch from scratch, step by step 2024-12-30

getmaxun/maxun: 🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta] 2024-12-30

SakanaAI/asal: Automating the Search for Artificial Life with Foundation Models! 2024-12-30

fallenshock/FlowEdit: Official implementation of the paper: “FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models” 2024-12-30

mingyuan-zhang/LMM: Large Motion Model for Unified Multi-Modal Motion Generation 2024-12-30

TencentARC/StereoCrafter: A framework to convert any 2D videos to immersive stereoscopic 3D 2024-12-30

THUDM/CogAgent: An open-sourced end-to-end VLM-based GUI Agent 2024-12-30

AriaUI/Aria-UI: Aria-UI: Visual Grounding for GUI Instructions 2024-12-30

modstart-lib/aigcpanel: AigcPanel 是一个简单易用的一站式AI数字人系统，支持视频合成、声音合成、声音克隆，简化本地模型管理、一键导入和使用AI模型。 2024-12-30

krystalan/DRT-o1: DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought 2024-12-30

zsyOAOA/InvSR: Arbitrary-steps Image Super-resolution via Diffusion Inversion 2024-12-30

livekit/agents: Build real-time multimodal AI applications 🤖🎙️📹 2024-12-30

modelscope/ClearerVoice-Studio: An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc. 2024-12-30

baaivision/See3D: You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale 2024-12-30

Nutlope/picMenu: Visualize menus in seconds with AI 2024-12-30

Avaiga/taipy: Turns Data and AI algorithms into production-ready web applications in no time. 2024-12-30

Hugging Face

QVQ 72B Preview - a Hugging Face Space by Qwen 2024-12-30

LuminaBrush - a Hugging Face Space by lllyasviel 2024-12-30

InvSR - a Hugging Face Space by OAOA 2024-12-30

ClearerVoice-Studio (Speech Enhancement, Separation and Extraction) - a Hugging Face Space by alibabasglab 2024-12-30

等待中的项目

Lifting Motion to the 3D World via 2D Diffusion 2024-12-30

Synthesizing Moving People with 3D Control 2024-12-30

pkulwj1994/diff_instruct_pp: We introduce Diff-Instruct++, a novel approach for human preference alignment of 1-step text-to-image generation. 2024-12-30

MegaSaM 2024-12-30

Sketch2Sound 2024-12-30

INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations 2024-12-30

From Slow Bidirectional to Fast Causal Video Generators 2024-12-30

项目官方网站

Zenodo 2024-12-20

Whisk 2024-12-20

labs.google/fx 2024-12-20

无问芯穹一站式AI平台 2024-12-20

VideoLingo - AI Subtitles Translation 2024-12-20

GitHub 项目

RedAIGC/Flux-version-LayerDiffuse 2024-12-20

microsoft/markitdown: Python tool for converting files and office documents to Markdown. 2024-12-20

franciszzj/Leffa: Learning Flow Fields in Attention for Controllable Person Image Generation 2024-12-20

wzhouxiff/ObjCtrl-2.5D: ObjCtrl-2.5D 2024-12-20

ali-vilab/FreeScale: Code for FreeScale, a tuning-free method for higher-resolution visual generation 2024-12-20

tumurzakov/AnimateDiff: AnimationDiff with train 2024-12-20

hkchengrex/MMAudio: [arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis 2024-12-20

TencentARC/BrushEdit: The official implementation of paper “BrushEdit: All-In-One Image Inpainting and Editing” 2024-12-20

TencentARC/ColorFlow: The official implementation of paper “ColorFlow: Retrieval-Augmented Image Sequence Colorization” 2024-12-20

IamCreateAI/Ruyi-Models 2024-12-20

Genesis-Embodied-AI/Genesis: A generative world for general-purpose robotics & embodied AI learning. 2024-12-20

Kedreamix/Linly-Dubbing: 智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能，语言无界” 2024-12-20

genmoai/mochi: The best OSS video generation models 2024-12-20

guoyww/AnimateDiff: Official implementation of AnimateDiff. 2024-12-20

Hugging Face

BrushEdit - a Hugging Face Space by TencentARC 2024-12-20

TRELLIS - a Hugging Face Space by JeffreyXiang 2024-12-20

等待中的项目

Motion Prompting: Controlling Video Generation with Motion Trajectories 2024-12-20

snap-research.github.io/wonderland/ 2024-12-20

X-Portrait 2: Highly Expressive Portrait Animation 2024-12-20

项目官方网站

New Chat | glhf.chat 2024-12-15

edify-3d Model by Shutterstock | NVIDIA NIM 2024-12-15

豆包 MarsCode - 工作台 2024-12-15

Devin 2024-12-15

DeepSeek - 探索未至之境 2024-12-15

Sora 2024-12-15

DeepLearning.AI - Learning Platform 2024-12-15

D5渲染器官网 | 实时光追渲染技术，重塑3D创作工作流 2024-12-15

PromptPerfect - AI Prompt Generator and Optimizer 2024-12-15

Learn Prompting: Your Guide to Communicating with AI 2024-12-15

GitHub 项目

hacksider/Deep-Live-Cam: real time face swap and one-click video deepfake with only a single image 2024-12-15

datawhalechina/llm-cookbook: 面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版 2024-12-15

f/awesome-chatgpt-prompts: This repo includes ChatGPT prompt curation to use ChatGPT better. 2024-12-15

Stability-AI/stable-audio-tools: Generative models for conditional audio generation 2024-12-15

isarandi/nlf: [NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation 2024-12-15

Stability-AI/stable-fast-3d: SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement 2024-12-15

lihxxx/DisPose: This repository is the official implementation of DisPose 2024-12-15

fkryan/gazelle 2024-12-15

tdrussell/diffusion-pipe: A pipeline parallel training script for diffusion models. 2024-12-15

openai/openai-cookbook: Examples and guides for using the OpenAI API 2024-12-15

promptslab/Awesome-Prompt-Engineering: This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc 2024-12-15

thunlp/Delta-CoMe: Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024 2024-12-15

Hugging Face

FlowEdit - a Hugging Face Space by fallenshock 2024-12-15

等待中的项目

Project Astra - Google DeepMind 2024-12-15

Project Mariner - Google DeepMind 2024-12-15

Jules (Confidential) 2024-12-15

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance 2024-12-15

SwiftEdit 2024-12-15

Michael Fischer 2024-12-15

Using Diffusion Priors for Video Amodal Segmentation 2024-12-15

项目官方网站

Fish Audio: Free Generative AI Text To Speech & Voice Cloning 2024-12-09

Generative Foundation Model - Amazon Nova - AWS 2024-12-09

RunComfy: Top ComfyUI Platform - Fast & Easy, No Setup 2024-12-09

提示工程指南 | Prompt Engineering Guide 2024-12-09

Prompt Engineering Guide | Prompt Engineering Guide 2024-12-09

Hailuo AI Audio: Create lifelike speech 2024-12-09

GitHub 项目

FunAudioLLM/CosyVoice: Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. 2024-12-09

FunAudioLLM/SenseVoice: Multilingual Voice Understanding Model 2024-12-09

modelscope/FunASR: A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc. 2024-12-09

yformer/EfficientTAM: Efficient Track Anything 2024-12-09

jingyaogong/minimind: 「大模型」3小时完全从0训练26M的小参数GPT，个人显卡即可推理训练！ 2024-12-09

kijai/ComfyUI-HunyuanVideoWrapper 2024-12-09

jianchang512/clone-voice: A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频 2024-12-09

dair-ai/Prompt-Engineering-Guide: 🐙 Guides, papers, lecture, notebooks and resources for prompt engineering 2024-12-09

memoavatar/memo: Memory-Guided Diffusion for Expressive Talking Video Generation 2024-12-09

1jsingh/negtome: Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance 2024-12-09

microsoft/TRELLIS: Official repo for paper “Structured 3D Latents for Scalable and Versatile 3D Generation”. 2024-12-09

Francis-Rings/StableAnimator: We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses. 2024-12-09

Hugging Face

CosyVoice-300M · 创空间 2024-12-09

ChatTTS Speaker - a Hugging Face Space by taa 2024-12-09

Flux Fill Outpainting - a Hugging Face Space by multimodalart 2024-12-09

Flux.1-dev Upscaler - a Hugging Face Space by jasperai 2024-12-09

Flux.1-dev Upscaler - a Hugging Face Space by Nymbo 2024-12-09

等待中的项目

Muse 2024-12-09

Introducing Veo and Imagen 3 on Vertex AI | Google Cloud Blog 2024-12-09

FLOAT 2024-12-09

Genie 2: A large-scale foundation world model - Google DeepMind 2024-12-09

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters 2024-12-09

Digital Life Project 2024-12-09

I2VControl: Disentangled and Unified Video Motion Synthesis Control 2024-12-09

DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction 2024-12-09

fugatto.github.io 2024-12-09

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models 2024-12-09

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance 2024-12-09

vision-xl 2024-12-09