-
Hugging Face
- Paris, France
- https://linkedin.com/in/anton-lozhkov/
- @anton_lozhkov
Stars
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
A course on aligning smol models.
Generate images from texts. In Russian
Fast, general, and tested differentiable structured prediction in PyTorch
Diffusion attentive attribution maps for interpreting Stable Diffusion.
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
Compute Sentence Embeddings Fast!
(ECCV 2020) RANSAC-Flow: generic two-stage image alignment
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
computational zoom from raw sensor data
HF's ML for Audio study group
A pytorch &keras implementation and demo of Fastformer.
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Tools for shrinking fastText models (in gensim format)
Code for the curation of The Stack v2 and StarCoder2 training data
📝 Utility to create, edit, and publish model cards on the Hugging Face Hub. [**Now lives in huggingface_hub**]