-
@drvalue Hanyang University ERICA
- Republic of Korea / Seoul
-
04:39
(UTC +09:00) - https://rhya-network.com
- yeok_sihun._.4
- @AoiTkns
Highlights
Stars
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monito…
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
Daemon to ban hosts that cause multiple authentication errors
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
Automated All-in-One OS Command Injection Exploitation Tool.
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection" (AAAI 2022 Oral)
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.