Skip to content
View mnmkng's full-sized avatar

Organizations

@apify

Block or report mnmkng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 4,376 302 Updated Nov 8, 2024

The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply moni…

Python 18,850 1,029 Updated Nov 7, 2024

I Don't Care About Cookies extension compiled for use with Playwright/Puppeteer

JavaScript 9 Updated Sep 9, 2024

Drag & drop UI to build your customized LLM flow

TypeScript 31,294 16,294 Updated Nov 8, 2024

A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain

Jupyter Notebook 3,451 732 Updated Mar 1, 2024

A standalone version of the readability lib

JavaScript 8,968 607 Updated Oct 17, 2024

The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.

Python 120 11 Updated Nov 8, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,976 4,843 Updated Oct 28, 2024

🚀 Fast and simple Node.js version manager, built in Rust

Rust 18,174 464 Updated Nov 8, 2024

The best way to write secure and reliable applications. Write nothing; deploy nowhere.

Dockerfile 60,847 4,716 Updated Aug 7, 2024

estela, an elastic web scraping cluster 🕸

TypeScript 172 13 Updated Oct 29, 2024

A JavaScript library for generating random user agents with data that's updated daily.

TypeScript 982 51 Updated Nov 8, 2024

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

TypeScript 15,498 665 Updated Nov 8, 2024

Apify SDK monorepo

TypeScript 123 35 Updated Nov 8, 2024

A comment system powered by GitHub Discussions. :octocat: 💬 💎

TypeScript 8,464 347 Updated Nov 2, 2024

Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

TypeScript 959 101 Updated Nov 4, 2024

Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

JavaScript 849 143 Updated Oct 15, 2024

Easy to maintain open source documentation websites.

TypeScript 56,605 8,498 Updated Nov 8, 2024

🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.

Python 33,862 3,677 Updated Nov 1, 2024

A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.

TypeScript 89 14 Updated Nov 14, 2022

Quick Look extension for highlight source code files on macOS 10.15 and later.

C++ 2,967 71 Updated Sep 24, 2024

macOS Quick Look extension for Markdown files.

C++ 1,349 32 Updated Sep 2, 2024

A macOS app for customizing which browser to start

Swift 3,729 137 Updated Sep 1, 2024

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

JavaScript 4,128 226 Updated Jul 17, 2024

Apify API client for Python

Python 47 11 Updated Nov 8, 2024

HTTP client made for scraping based on got.

TypeScript 550 43 Updated Oct 23, 2024

Use HTTP/2 the same way like HTTP/1

JavaScript 239 18 Updated May 7, 2024

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

TypeScript 66,804 3,664 Updated Nov 8, 2024

The web scraper that's nearly impossible to block - now called @ulixee/hero

TypeScript 671 45 Updated Mar 7, 2023

House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.

JavaScript 117 44 Updated Apr 13, 2023
Next