Skip to content
View maulinniam's full-sized avatar

Block or report maulinniam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
69 stars written in Python
Clear filter

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 58,885 11,141 Updated Oct 27, 2025

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

Python 16,263 2,779 Updated Feb 23, 2023

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Python 8,841 1,580 Updated Jun 10, 2024

Community maintained fork of pdfminer - we fathom PDF

Python 6,778 1,010 Updated May 6, 2025

2025! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.

Python 2,007 240 Updated Apr 29, 2025

Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf

Python 1,724 261 Updated Sep 29, 2021

A BERT model for scientific text.

Python 1,648 232 Updated Feb 22, 2022

A program for financial portfolio management, analysis and optimisation.

Python 1,646 220 Updated Nov 4, 2023

A neurosymbolic perspective on LLMs

Python 1,629 80 Updated Nov 6, 2025

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Python 1,250 152 Updated Jul 24, 2025

Data Provinsi, Kota/Kabupaten, Kecamatan, dan Kelurahan/Desa di Indonesia

Python 956 767 Updated Apr 11, 2022

Minimal python wrapper for Twitter's REST and Streaming APIs

Python 946 264 Updated Mar 23, 2023

Publication-quality network visualisations in python

Python 734 43 Updated Jan 21, 2025

The GitHub repository for the paper: “Time Series is a Special Sequence: Forecasting with Sample Convolution and Interaction“. (NeurIPS 2022)

Python 663 128 Updated Jul 12, 2023

Twelve Data Python Client - Financial data API & WebSocket

Python 539 81 Updated Apr 12, 2025

Python project for real-time financial data collection, analyzing && backtesting trading strategies

Python 435 149 Updated Oct 20, 2014

Indonesian stemmer. Python port of PHP Sastrawi project.

Python 348 112 Updated Jun 20, 2021

Analyze Data with Pandas-based Networks. Documentation:

Python 321 46 Updated Aug 20, 2025

A curated list of awesome Zotero resources

Python 308 6 Updated Oct 23, 2025

This tool should help discover different patterns based on similarity measures in historical (financial) data

Python 248 85 Updated Jun 20, 2023

Download U.S. census data and reformat it for humans

Python 225 37 Updated Sep 30, 2024

LitStudy: Using the power of Python to automate scientific literature analysis from the comfort of a Jupyter notebook

Python 205 61 Updated May 27, 2025

Template repository for data science lifecycle project

Python 198 62 Updated Jul 2, 2020

Computer-Assisted Reporting and Data Journalism Syllabuses, compiled by Dan Nguyen

Python 184 27 Updated Mar 3, 2021

A Bibliometric and Scientometric Python Library Powered with Artificial Intelligence Tools

Python 177 30 Updated Sep 10, 2025

Yet another paper reading assistant based on OpenAI ChatGPT API. An open-source version that attempts to reimplement ChatPDF. A different dialogue version of another ChatPaper project.

Python 177 16 Updated Jan 3, 2024

A Python library for doing bibliometric and network analysis in science and health policy research

Python 175 34 Updated Jun 9, 2022

High level script for finding tweets using Python 3 and Tweepy

Python 170 74 Updated Aug 28, 2021

Based on URL and Organization Name, collect the IP Ranges, subdomains using various tools like Amass, subfinder, etc.. And check for uphost and Run Masscan to grap CNAME entries, take the screensho…

Python 159 31 Updated May 1, 2024

An open-source utility to scrape Google Books

Python 107 24 Updated Jan 26, 2025
Next