- Hong Kong
-
19:36
(UTC +08:00) - naozumi.me
- https://orcid.org/0009-0005-5105-8231
- https://huggingface.co/Naozumi0512
Highlights
Stars
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high eff…
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
π RuView: WiFi DensePose turns commodity WiFi signals into real-time human pose estimation, vital sign monitoring, and presence detection — all without a single pixel of video.
Build ultra fast, tiny, and cross-platform desktop apps with Typescript.
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
中文拼音打字练习工具,支持汉语拼音与粤语拼音,可切换简繁体显示并输入自定义文本。 A web-based Chinese typing practice tool supporting Mandarin Pinyin and Cantonese Jyutping.
Speaker embedding for anime speech domain based on ECAPA_TDNN
Create stunning demos for free. Open-source, no subscriptions, no watermarks, and free for commercial use. An alternative to Screen Studio.
Fully automatic censorship removal for language models
ModernBERT train from scratch in Cantonese
PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image (CVPR 2026)
A modular and open-ended toolkit for WebGPU, with advanced type inference and the ability to write shaders in TypeScript
javascript-obfuscator plugin for next.js
Demonstration of running a native LLM on Android device.
NEO Series: Native Vision-Language Models from First Principles
A library to capture canvas-based animations at a fixed framerate
Universal Notation for Tensor Operations in Python
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
An open source DevOps tool from the CNCF for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifact.
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…
A Framework for Speech, Language, Audio, Music Processing with Large Language Model
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
Train transformer language models with reinforcement learning.
FastAPI + Playwright + Camoufox 中间层代理服务器,兼容OpenAI API且支持参数转发。项目通过浏览器自动化将API请求转发到 Google AI Studio 网页,并同样按照OpenAI标准格式返回的工具。内置调试WebUI面板。
A high-throughput and memory-efficient inference and serving engine for LLMs
✨ Reverse-engineered Python API for Google Gemini web app