Amit Chaudhary
  • About
  • Notes
  • Toolbox
Categories
All (40)
agents (1)
colab (2)
data-augmentation (3)
embeddings (3)
evals (3)
function-calling (1)
information-retrieval (1)
linux (2)
llm (3)
maths (1)
misc (1)
nlp (18)
python (7)
pytorch (1)
rag (1)
self-supervised-learning (7)
semi-supervised-learning (2)
synthetic-data (1)
zero-shot-learning (2)

The Anatomy of Tool Calling

function-calling
agents
python

Learn how to use Python’s runtime introspection to dynamically provide tool definitions to LLMs

Feb 15, 2025
8 min

Evals for Diversity in Synthetic Data

synthetic-data
evals
llm

An overview of evaluation metrics for measuring linguistic diversity in LLM-generated synthetic data

Feb 9, 2025
12 min

Zero-Cost Custom Feeds on Bluesky

misc

A simple stack for generating custom feeds for Bluesky programmatically without a backend server

Dec 1, 2024
12 min

Parallel Processing with tqdm

python

A dead-simple way to perform parallel processing with progress bars natively in tqdm

Oct 20, 2024
4 min
Example of a regular expression

A Visual Guide to Regular Expression

python
nlp

A mental model of how various components of a regular expression work from the bottom-up.

Oct 21, 2020
8 min

Knowledge Transfer in Self Supervised Learning

self-supervised-learning

A general framework to transfer knowledge from deep self-supervised models to shallow task-specific models

Oct 4, 2020
11 min

Interactive Analysis of Sentence Embeddings

nlp
embeddings

Learn how to interactively explore sentence embeddings and labels in Tensorflow Embedding Projector

Sep 24, 2020
6 min

VSCode on Google Colab

colab

Learn how to setup and use VSCode as an IDE on Google Colab and Kaggle.

Sep 1, 2020
5 min

Unsupervised Keyphrase Extraction

nlp

Learn about unsupervised algorithms for automatically extracting representative keyword and phrases from documents

Aug 30, 2020
16 min

Text Data Augmentation with MarianMT

nlp
data-augmentation

Learn how to use machine translation models in Hugging Face Transformers for data augmentation

Aug 30, 2020
5 min

Evaluation Metrics For Information Retrieval

information-retrieval
evals
rag

Learn about common metrics used to evaluate performance of information retrieval systems

Aug 4, 2020
22 min

Behavioral Testing of NLP models

nlp
evals

An overview of the “CheckList” framework for fine-grained evaluation of NLP models

Jul 28, 2020
59 min

Semi-Supervised Learning in Computer Vision

semi-supervised-learning

A comprehensive overview of recent semi-supervised learning methods in Computer Vision

Jul 12, 2020
14 min

FastAPI for Flask Users

python

A comprehensive guide to FastAPI with a side-by-side code comparison with Flask

Jun 29, 2020
10 min

Google Colab Tips for Power Users

colab

Learn about lesser-known features in Google Colaboratory to improve your productivity.

Jun 26, 2020
12 min

A Visual Guide to FastText Word Embeddings

nlp
embeddings

A deep-dive into how FastText enriches word vectors with subword information

Jun 21, 2020
15 min

Universal Sentence Encoder Visually Explained

nlp
embeddings

A deep-dive into how Universal Sentence Encoder learns to generate fixed-length sentence embeddings

Jun 15, 2020
9 min

Zero-shot Text Classification With Generative Language Models

nlp
zero-shot-learning
llm

A text generation approach to zero-shot text classification with GPT-2

Jun 7, 2020
21 min

Exploring Knowledge Captured in Probability of Strings

nlp
llm

An exploration of simple knowledge captured by language models

Jun 7, 2020
6 min

Zero Shot Learning for Text Classification

nlp
zero-shot-learning

A summary of “Train Once, Test Anywhere” paper for zero-shot text classification

May 30, 2020
6 min

Self Supervised Representation Learning in NLP

nlp
self-supervised-learning

An overview of self-supervised pretext tasks in Natural Language Processing

May 23, 2020
7 min

A Visual Survey of Data Augmentation in NLP

nlp
data-augmentation

An extensive overview of text data augmentation techniques for Natural Language Processing

May 16, 2020
12 min

A Commit History of BERT and its Forks

nlp

What a commit history of version-controlled research papers could look like?

May 9, 2020
5 min

A Visual Guide to Recurrent Layers in Keras

nlp

Understand how to use Recurrent Layers like RNN, GRU, and LSTM in Keras with diagrams

Apr 23, 2020
5 min

A Visual Exploration of DeepCluster

self-supervised-learning

DeepCluster is a self-supervised method to combine clustering and representation learning

Apr 14, 2020
9 min

A Visual Guide to Self-Labelling Images

self-supervised-learning

A self-supervised method to generate labels via simultaneous clustering and representation learning

Apr 10, 2020
11 min

The Illustrated FixMatch for Semi-Supervised Learning

semi-supervised-learning

Learn how to leverage unlabeled data using FixMatch for semi-supervised learning

Mar 31, 2020
14 min

The Python Magic Behind PyTorch

python
pytorch

Learn about the advanced python native features behind PyTorch

Mar 23, 2020
8 min

The Illustrated PIRL: Pretext-Invariant Representation Learning

self-supervised-learning

Learn how PIRL generates image representations invariant to transformation in a self-supervised manner

Mar 16, 2020
8 min

The Illustrated SimCLR Framework

self-supervised-learning

A visual guide to the SimCLR framework for contrastive learning of visual representations.

Mar 4, 2020
8 min

The Illustrated Self-Supervised Learning

self-supervised-learning

A visual introduction to self-supervised learning methods for visual representations.

Feb 25, 2020
9 min

Back Translation for Text Augmentation with Google Sheets

nlp
data-augmentation

Learn how to augment existing labeled text data for free using Google Sheets.

Feb 19, 2020
3 min

A Visual Guide to ALBERT (A Lite BERT)

nlp

An illustrated summary of the ALBERT paper

Feb 8, 2020
8 min

Transfer Learning in NLP with Tensorflow Hub and Keras

nlp

Learn how to integrate and finetune tensorflow-hub modules in Tensorflow 2.0

Feb 2, 2020
6 min

Migrating from OS.PATH to PATHLIB Module in Python

python

Learn how to use the modern pathlib module to perform tasks you have been using os.path for

Dec 29, 2019
2 min

Math Symbols Explained with Python

maths

Learn the meaning behind mathematical symbols used in Machine Learning using your knowledge of Python.

Aug 3, 2019
5 min

Language Detection in Python

nlp

Learn how to detect the language of a given piece of text using Natural Language Processing.

Jul 15, 2019
6 min

How to Automate Manual Steps after SSH

linux

Learn how to automate repetitive commands after connecting to a SSH server

Mar 31, 2019
2 min

Django ORM if you already know SQL

python

Learn how to use Django ORM by drawing analogies to equivalent SQL statements

Oct 30, 2018
4 min

Shutdown Ubuntu With A Keyboard Shortcut

linux
In Windows, we can use Alt+F4 keyboard shortcut to shutdown. But Linux doesn’t have such feature out of the box. After switching to Ubuntu, I struggled trying to make a…
Jun 16, 2015
2 min
No matching items