Skip to content
View cjer's full-sized avatar

Organizations

@hasadna @WaiSystems @omilab @BIU-NLP @OnlpLab

Block or report cjer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Modeling, training, eval, and inference code for OLMo

Python 6,411 722 Updated Nov 24, 2025

A codebase for "Local Model-Agnostic Explanations for Ranking Model Interpretability"

Jupyter Notebook 4 1 Updated Jul 5, 2023

Hebrew oriented NER spaCy pipeline

Python 21 7 Updated Aug 8, 2024

Hebrew PHI identification and redaction toolkit

Python 20 6 Updated Mar 21, 2024

Neural Modeling for Named Entities and Morphology (Hebrew NER)

Python 33 11 Updated Dec 20, 2022

Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested mentions, and more.

11 3 Updated Dec 27, 2021

Papers on fairness in NLP

452 53 Updated May 2, 2024

A neural network layer that enables training of deep neural networks directly from crowdsourced labels (e.g. from Amazon Mechanical Turk) or, more generally, labels from multiple annotators with di…

Jupyter Notebook 69 20 Updated Dec 13, 2021

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.

Python 321 53 Updated Mar 15, 2026

Tool for parsing and converting various span encoding schemes.

Python 23 2 Updated Jan 13, 2024

Pandas Network Analysis by UrbanSim: fast accessibility metrics and shortest paths, using contraction hierarchies 🗺️

C++ 417 98 Updated Nov 25, 2023

A tool for GTFS transit and OSM pedestrian network accessibility analysis by UrbanSim

Python 259 62 Updated May 11, 2023

Tools for the extraction of OpenStreetMap street network data

Python 62 18 Updated Aug 8, 2023

Human annotations for "Inherent Disagreements in Human Textual Inferences" paper

9 1 Updated Sep 3, 2019

A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word expression extraction.

Python 23 3 Updated Aug 13, 2022
Jupyter Notebook 5 1 Updated Aug 3, 2019

Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual support)

Java 48 9 Updated Aug 14, 2023

A node.js port to the JavaScriptCore engine and iOS

JavaScript 223 16 Updated Nov 25, 2022

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

Python 3,085 426 Updated Jan 12, 2024

🚌 Analysing Israel's public transport data

Java 111 28 Updated Dec 9, 2022

GTFS ORM using SQLAlchemy

Python 177 44 Updated Jan 12, 2026

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Scala 157 19 Updated Oct 8, 2025

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Scala 153 33 Updated Dec 5, 2025

peartree: A library for converting transit data into a directed graph for sketch network analysis.

Python 207 26 Updated May 5, 2023

Collects and parses price data from Israeli supermarkets.

Go 8 3 Updated Jan 22, 2018

A fast, forgiving GTFS reader built on pandas DataFrames

Python 182 23 Updated Dec 3, 2023