Stars
- All languages
- Adblock Filter List
- C
- C#
- C++
- CSS
- Clojure
- Dart
- Dockerfile
- Elm
- Emacs Lisp
- Go
- Groff
- HCL
- HTML
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- MATLAB
- MDX
- Makefile
- OpenEdge ABL
- PHP
- PLpgSQL
- Perl
- PowerShell
- Python
- R
- RMarkdown
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Svelte
- Swift
- TSQL
- TeX
- TypeScript
- Visual Basic
- Vue
- YARA
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
⚡ HugoBlox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇
extract text from any document. no muss. no fuss.
Twitter Text Libraries. This code is used at Twitter to tokenize and parse text to meet the expectations for what can be used on the platform.
A frictionless, pipeable approach to dealing with summary statistics
Beautiful and customizable model summaries in R.
A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html
Open source project for data preparation for GenAI applications
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Find dates inside text using Python and get back datetime objects
Mixed-effects models in R using S4 classes and methods with RcppEigen
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
A repository to monitor attack vectors from state-backed information operations
Summer Institutes in Computational Social Science
The File System State Monitor keeps track of the state of any number of paths and will fire events when said state changes (create/update/delete). FSSM supports using FSEvents on MacOS, Inotify on …
An R package for the extraction of sentiment and sentiment-based plot arcs from text
Static and dynamic network visualization with R - code and tutorial from Sunbelt 2019 workshop.
Everyday things people use in Pytorch. No need to spend hours reading Pytorch forums trying to find them!
An open source online platform for collaborative image labeling
The Internet Monitor is a research project to evaluate, describe, and summarize the means, mechanisms, and extent of Internet content controls and Internet activity around the world.
A collection of R packages spanning natural language processing, statistical analysis, data visualization, and text analysis
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Intro to Machine Learning with the Tidyverse
Code and Resources for "Applied Machine Learning"