☕
Drinking coffee
I'm a Principal Research Scientist at the Common Crawl Foundation. Weird coffee person and slow marathon runner.
- Paris
-
05:52
(UTC +02:00) - https://portizs.eu
- https://orcid.org/0000-0003-0343-8852
- @pjox13
- @pjox@mastodon.social
- @pjox.bsky.social
Highlights
- Pro
Stars
8
stars
written in Java
Clear filter
A configuration as code language with rich validation and tooling.
A machine learning software for extracting information from scholarly documents
A machine learning tool for fishing entities
The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)
commoncrawl / nutch
Forked from apache/nutchCommon Crawl fork of Apache Nutch
Analytic platform for the HAL research archive (in development)
Simple Java client for GROBID REST services