hi! i'm aarushi singh

working on tech, safety and philosophy for humans and agents

building at the intersection of machine learning research and systems engineering. currently focused on multimodal llms, agentic infrastructure, and retrieval-augmented generation. previously interned at microsoft on the azure data spark native execution engine. open-source contributor to transformers, langchain, and more.

work

software engineer intern

microsoft · azure data

worked on the azure spark native execution engine (nee) using c++, scala, velox, gluten. integrated fuzz-testing pipelines, improved operator reliability, and enhanced ci/cd diagnostics for large-scale distributed sql execution.

jun–aug 2025

undergraduate ml researcher

bennett university

researched recommender systems and computer vision using pytorch and tensorflow. improved ndcg/mrr on matrix factorization models and benchmarked cnn/transformer architectures for emotion recognition on fer2013.

aug 2024–now

open source

huggingface / transformers

contributions to the core library and model architectures.

2 contributions

source

karpathy / nanochat

simple and clean web interface for local LLM assistants.

1 contribution

source

langchain-ai / langchain

building context-aware reasoning applications and agents.

1 contribution

source

databricks / dbt-databricks

the database tool (dbt) adapter for databricks runtimes.

3 contributions

source

stripe / ai

stripe integrations and SDK patterns for AI products.

1 contribution

source

projects

clipdb

privacy-first clipboard history manager with fuzzy search and aes-256 encryption. open-source.

pythoncliaes-256

source

distributed log aggregation

high-throughput log ingestion system handling 50k+ logs/sec using go concurrency and kafka.

gokafkapostgresqldocker

enterprise ai workflows

end-to-end workflow system integrating azure openai and ai search for enterprise-grade automation.

pythonfastapiazure openaireact

volatility surface modelling

generative models (gan/vae) to produce smooth volatility surfaces for option pricing.

pythonganvaequantlib

pegasus transformer fine-tuning

fine-tuned pegasus on aeslc for abstractive summarization in low-resource email domains.

pytorchtransformersnlp

supply chain optimization

optimal path algorithms using dijkstra's and custom regression models for yield prediction.

c++python

seq2seq summarization

sequence-to-sequence model using encoder-decoder transformers for document summarization.

pythonseq2seqtransformers

research

memory isolation in multi-agent llm systems

formulated memory interference as a failure mode in llm-based multi-agent systems, designing architectural variants for controlled retrieval-scoping.

independent research2025

llmfuzz-bench

designed a controlled evaluation framework to quantify output stability of llms under stochastic decoding conditions across 1,500+ tasks.

independent research2025

recommender systems & diversity

researching methods to improve diversity, relevance, and ranking stability in recommendation pipelines using llm-driven approaches.

bennett university2024 – ongoing

emotion recognition with cnns & transformers

benchmarked cnn and transformer architectures for emotion recognition on fer2013 dataset, analysing accuracy-efficiency tradeoffs.

computer vision2024

matrix factorization enhancements

improved ndcg/mrr metrics on matrix factorization models through novel regularization and training strategies.

collaborative filtering2024

blog

11 feb 2026

your brain already solved the problem ai agents need to work on

multi-agent memory architecture mirrors 500 million years of neural evolution. your brain doesn't blend kitchen and living room memories into "memory soup" — multi-agent ai systems are converging on the same solution.

multi-agent memory

read

12 jan 2026

should models always finish their sentences: the case for explicit hesitation

the industry has spent three years optimizing for fluency, creating the world's most confident liars. hallucination is a structural byproduct of our objective functions, not a bug to patch with more data.

llmhesitation

read

skills

c++pythongojavascalabashsqldockergitazure devopsci/cdkafkafastapiflaskpostgresqlmysqlpytorchtensorflowkerastransformershugging facelangchainllamascikit-learnnumpypandasopencvpytestlinuxjupyter