GitHub - Daniellli/DKT: official implement of "Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation"

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Shaocong Xu, Songlin Wei, Qizhe Wei, Zheng Geng, Hong Li, Licheng Shen, Qianpu Sun, Shu Han, Bin Ma, Bohan Li, Chongjie Ye, Yuhang Zheng, Nan Wang, Saining Zhang, and Hao Zhao

🌟 Takeaways

DKT is a foundation model for transparent-object 🫙, in-the-wild 🌎, arbitrary-length ⏳ video depth and normal estimation, facilitating downstream applications such as robot manipulation tasks, policy learning, and so forth.

✨ News

[25-12-04] 🔥🔥🔥 DKT is released now, have fun!

🤗 Pretrained Models

Our pretrained models are available on the huggingface hub:

Version	Hugging Face Model
DKT-Depth-1-3B	`Daniellesry/DKT-Depth-1-3B`

📦 Installation

Please run following commands to build package:

git clone https://github.com/Daniellli/DKT.git
cd DKT
pip install -r requirements.txt

🤖 Gradio Demo

Online demo: DKT
Local demo:

python app.py

💡 Usage

from dkt.pipelines.pipelines import DKTPipeline
import os
from tools.common_utils import save_video


pipe = DKTPipeline()

demo_path = 'examples/1.mp4'
prediction = pipe(demo_path)


save_dir = 'logs'
os.makedirs(save_dir, exist_ok=True)
output_path = os.path.join(save_dir, 'demo.mp4')
save_video(prediction['colored_depth_map'], output_path, fps=25)

💗 Ackownledge

Our code is based on recent fantastic works including MoGe, WAN, and DiffSynth-Studio. We sincerely thank the authors for their excellent contributions. Huge thanks!

📜 Citation

...

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
dkt		dkt
doc		doc
examples		examples
tools		tools
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
demo.py		demo.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

🌟 Takeaways

✨ News

🤗 Pretrained Models

📦 Installation

🤖 Gradio Demo

💡 Usage

💗 Ackownledge

📜 Citation

About

Uh oh!

Releases

Packages

Languages

License

Daniellli/DKT

Folders and files

Latest commit

History

Repository files navigation

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

🌟 Takeaways

✨ News

🤗 Pretrained Models

📦 Installation

🤖 Gradio Demo

💡 Usage

💗 Ackownledge

📜 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages