-
Institute of automation, Chinese academy of science
- Haidian district ,Beijing, China
-
17:07
(UTC -12:00) - https://Casia_Dominic
Lists (15)
Sort Name ascending (A-Z)
3D Gaussian
3D Gaussian models for digital human4d Gen
符号模型表征Diffusion model for audio
speech & audio generationdiffusion model for eeg
Diffusion models for 3D
Diffusion models for images
Global Illumination Accalerating
tagc 腾讯游戏场景渲染加速,带tod时间通道LLM
Starred repositories
Stable Diffusion web UI
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
No fortress, purely open ground. OpenManus is Coming.
Instant voice cloning by MIT and MyShell. Audio foundation model.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
deep learning for image processing including classification and object-detection etc.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
The official GitHub page for the survey paper "A Survey of Large Language Models".
A collaboration friendly studio for NeRFs
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
⚡机器学习实战(Python3):kNN、决策树、贝叶斯、逻辑回归、SVM、线性回归、树回归
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
Sharp Monocular View Synthesis in Less Than a Second
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
Denoising Diffusion Probabilistic Models
This patch removes restriction on maximum number of simultaneous NVENC video encoding sessions imposed by Nvidia to consumer-grade GPUs.
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
A series of large language models developed by Baichuan Intelligent Technology