Stars
Now, Stronger AI Pushes Frontiers, Stronger Our Shared Future.
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
KoWit-24: A Dataset with Fine-Grained Annotations of Wordplay
Multilingual dataset for principal parts detection in inflectional morphology (CoNLL 2025)
High-performance vector similarity library in Rust with Python bindings: Spearman, Kendall, distance correlation, Jensen-Shannon, Hoeffding's D, and bootstrapped confidence intervals
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
This is for C2D2 Dataset: A Resource for Analyzing Cognitive Distortions and Its Impact on Mental Health
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
Data for "Identification of Disease or Symptom terms in Reddit to Improve Health Mention Classification" paper accepted at The Web Conference '22
A collection of datasets for depression detection/ modelling from social media data
The unofficial python package that returns response of Google Bard through cookie value.
This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.
Mining Legal Arguments in Court Decisions - Data and software
A Spacy Package for Romanian Legal Document Processing
Awesome-LLM: a curated list of Large Language Model
Fast & Simple repository for pre-training and fine-tuning T5-style models
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
A clinical BERT-based NLP tool for parsing clinical trial abstracts following the PICO framework
This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Datasets, sentiment analysis model, and results to support the paper "SART & COVIDSentiRo: Datasets for Sentiment Analysis Applied to Analyzing COVID-19 Vaccination Perception in Romanian Tweets"
Scripts and data from the paper "Ceolin, A., Guardiano, C., Longobardi, G., Irimia, M. A., Bortolussi L., & Sgarro A. (2021). At the Boundaries of Syntactic Prehistory. Philosophical Transactions o…