-
Georgia Institute of Technology
- United States
- https://cocoxu.github.io/
- @cocoweixu
Highlights
- Pro
Stars
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
Repository for our paper “Subverting the Jewtocracy”: Online Antisemitism Detection Using MultimodalDeep Learning
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
Source code for Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains (ACL 2021).
A beautiful, simple, clean, and responsive Jekyll theme for academics
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
A collection of Farsi (Persian) datasets
Scripts to preprocess training and test data and to run fast_align and giza
Image/text geolocation with tensorflow and the MvMF
Neural CRF Model for Sentence Alignment in Text Simplification
A graphical interface for gold-standard corpus annotation of alignment and paraphrasing
A sentiment classifier tool and library trained on Twitter data
Extracting useful metadata from Wikipedia dumps in any language.
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目
Code for NAACL 2018 paper "Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate Label Spaces" by Isabelle Augenstein, Sebastian Ruder, Anders Søgaard
Tutorial on computational models of language change
Software for writing protocols and running them on the Opentrons Flex and Opentrons OT-2
BiLSTM-CNN-CRF architecture for sequence tagging using ELMo representations.
Files for the Python lecture I give at IA-UNAM
IPython Notebooks to learn Python