-
CyberAgent, Inc.
- Nagoya, Japan
-
01:39
(UTC +09:00) - https://orcid.org/0000-0002-6163-6251
- @PINTO03091
- https://zenn.dev/pinto0309
- https://qiita.com/PINTO
Sponsors
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
- All languages
- ASP.NET
- ApacheConf
- Assembly
- BitBake
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Cython
- Dart
- Dockerfile
- GLSL
- Gnuplot
- Go
- Groovy
- HTML
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- MATLAB
- MDX
- MLIR
- OCaml
- PHP
- PowerShell
- Processing
- Python
- Roff
- Ruby
- Rust
- ShaderLab
- Shell
- Starlark
- Swift
- TeX
- TypeScript
- Verilog
Towards End-to-end Video-based Eye-tracking. ECCV 2020. https://ait.ethz.ch/eve
Official implementation of the paper "Recovering 3D Hand Mesh Sequence from a Single Blurry Image: A New Dataset and Temporal Unfolding" (CVPR 2023)
Repository for the paper "Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop"
Official code for MAMMA: Markerless Accurate Multi-person Motion Acquisition.
a *fast* image-data annotator leveraging foundation models for automation
LibreYOLO is a MIT licensed open source computer vision library
TensorRT Engine for SAM-3 model by Meta AI
Eagle: Frontier Vision-Language Models with Data-Centric Strategies
Code used to produce the results of the paper "BiternionNets: Continuous Head Pose Regression from Discrete Training Labels"
An implementation of BiternionNets for ROS, ready to run on a robot.
SAM 3D bodyを用いて画像から3D人形(Blender用のメッシュとボーン▶FBX、ClipStudioPaint用のポーズデータ▶BVH)を取り出すツールです
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
HumanNet: Scaling Human-centric Video Learning to One Million Hours
RynnBrain: Open Embodied Foundation Models
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Cross-platform supply-chain guard for CI: supervised-run audit/block (eBPF/ETW) + minimum-release-age proxy & lockfile check for npm, cargo, PyPI, NuGet.
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
Code of paper "A Doubly Decoupled Network for Edge Detection"
Halpe: full body human pose estimation and human-object interaction detection dataset
Template matching for rotation using Radon transforms.
🔥 [ICCV 2025 Highlight] Official open-source repo for LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition
[ICCV 2023] TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective
Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.
Gemini Live provides multimodal realtime agent capabilities. Build voice agents that can process vision and text in realtime.