Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.
-
Updated
Nov 26, 2025 - Python
Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.
This project provides a powerful web scraping tool that fetches search results and converts them into Markdown format using FastAPI, SearXNG, and Browserless. It includes the capability to use proxies for web scraping and handles HTML content conversion to Markdown efficiently.
An assistant for Slack built with Arcade and Langgraph. Interact with Google Calendar, Mail, Github, Search Engines, Firecrawl and more all from within Slack
Open‑source alternative to Perplexity Comet and director.ai and firecrawl combined
CrewNews is an AI news generator that delivers an unbiased version of the news for a given topic, using Streamlit for the GUI, Llama 3.1 as the LLM (inferenced via the AIML API), CrewAI for building AI agents, AgentOps for testing AI agents, and Exa & Firecrawl as tools
AI Lead Generation Agent that automatically discovers and qualifies potential leads from Quora. Using Firecrawl for intelligent web scraping, Phidata for agent orchestration, and Composio for Google Sheets integration, you'll create a system that can continuously generate and organize qualified leads with minimal human intervention!
Just mention want you want and it will extract/scrape data from the Web. Useful to create AI web search+extraction/scraping agent, RAG with web data etc.
This repository demonstrates how to leverage OpenAI's GPT-4 models with JSON Strict Mode to extract structured data from web pages. It combines web scraping capabilities from Firecrawl with OpenAI's advanced language models to create a powerful data extraction pipeline.
DRIA (Deep Research and Intelligence Agent) is a fully local voice assistant that can hold real-time conversations while performing deep research in the background — powered by Firecrawl, Mistral AI, Perplexica, and LiveKit.
AI-powered web scraping agent built with LangGraph, LangSmith, Firecrawl, and Anthropic AI. Automates intelligent crawling, structured data extraction, and LLM-powered content formatting. Efficiently handles anti-bot mechanisms, error recovery, and batch processing. 🚀
🧳 A state-of-the-art multi-agent travel planning system powered by OpenAI Agents SDK and LangGraph orchestration. Leverages Stagehand/Playwright for browser automation, Supabase for data persistence, and Firecrawl/Tavily for intelligent research.
Retrieval-augmented docs ingestion stack: Firecrawl + Crawl4AI + Qdrant vector search with FastAPI and MCP interfaces for AI engineers.
This is a repository of collection of many agents build on top of Langchain , Langgraph, MCP and so many amazing tools
Implementing LangChain concepts and building meaningful stuffs
An AI-powered Market Intelligence Agent that analyzes live market data, extracts insights using LLMs, generates strategic recommendations, and visualizes trends with interactive charts — built with Streamlit, LangChain, Firecrawl, and Plotly.
A Python port of the amazing @dzhng's deep-research project an AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the simplest implementation of a deep research agent.
A utility to crawl and download all images and videos from Ghost blogs, organized by article slug for easy migration to self-hosted platforms.
Analyzing GitHub Issues Using GPT-5.1 Codex
Add a description, image, and links to the firecrawl topic page so that developers can more easily learn about it.
To associate your repository with the firecrawl topic, visit your repo's landing page and select "manage topics."