hi! i'm aarushi singh

working on tech, safety and philosophy for humans and agents

building at the intersection of machine learning research and systems engineering. currently focused on multimodal llms, agentic infrastructure, and retrieval-augmented generation. previously interned at microsoft on the azure data spark native execution engine. open-source contributor to transformers, langchain, and more.

work

02

software engineer intern

microsoft · azure data

worked on the azure spark native execution engine (nee) using c++, scala, velox, gluten. integrated fuzz-testing pipelines, improved operator reliability, and enhanced ci/cd diagnostics for large-scale distributed sql execution.

jun–aug 2025

undergraduate ml researcher

bennett university

researched recommender systems and computer vision using pytorch and tensorflow. improved ndcg/mrr on matrix factorization models and benchmarked cnn/transformer architectures for emotion recognition on fer2013.

aug 2024–now

open source

05
huggingface / transformers

contributions to the core library and model architectures.

2 contributions
source
karpathy / nanochat

simple and clean web interface for local LLM assistants.

1 contribution
source
langchain-ai / langchain

building context-aware reasoning applications and agents.

1 contribution
source
databricks / dbt-databricks

the database tool (dbt) adapter for databricks runtimes.

3 contributions
source
stripe / ai

stripe integrations and SDK patterns for AI products.

1 contribution
source

projects

07
clipdb

privacy-first clipboard history manager with fuzzy search and aes-256 encryption. open-source.

pythoncliaes-256
source
distributed log aggregation

high-throughput log ingestion system handling 50k+ logs/sec using go concurrency and kafka.

gokafkapostgresqldocker
enterprise ai workflows

end-to-end workflow system integrating azure openai and ai search for enterprise-grade automation.

pythonfastapiazure openaireact
volatility surface modelling

generative models (gan/vae) to produce smooth volatility surfaces for option pricing.

pythonganvaequantlib
pegasus transformer fine-tuning

fine-tuned pegasus on aeslc for abstractive summarization in low-resource email domains.

pytorchtransformersnlp
supply chain optimization

optimal path algorithms using dijkstra's and custom regression models for yield prediction.

c++python
seq2seq summarization

sequence-to-sequence model using encoder-decoder transformers for document summarization.

pythonseq2seqtransformers

research

05
memory isolation in multi-agent llm systems

formulated memory interference as a failure mode in llm-based multi-agent systems, designing architectural variants for controlled retrieval-scoping.

independent research2025
llmfuzz-bench

designed a controlled evaluation framework to quantify output stability of llms under stochastic decoding conditions across 1,500+ tasks.

independent research2025
recommender systems & diversity

researching methods to improve diversity, relevance, and ranking stability in recommendation pipelines using llm-driven approaches.

bennett university2024 – ongoing
emotion recognition with cnns & transformers

benchmarked cnn and transformer architectures for emotion recognition on fer2013 dataset, analysing accuracy-efficiency tradeoffs.

computer vision2024
matrix factorization enhancements

improved ndcg/mrr metrics on matrix factorization models through novel regularization and training strategies.

collaborative filtering2024

blog

02
11 feb 2026
your brain already solved the problem ai agents need to work on

multi-agent memory architecture mirrors 500 million years of neural evolution. your brain doesn't blend kitchen and living room memories into "memory soup" — multi-agent ai systems are converging on the same solution.

multi-agent memory
read
12 jan 2026
should models always finish their sentences: the case for explicit hesitation

the industry has spent three years optimizing for fluency, creating the world's most confident liars. hallucination is a structural byproduct of our objective functions, not a bug to patch with more data.

llmhesitation
read

skills

c++pythongojavascalabashsqldockergitazure devopsci/cdkafkafastapiflaskpostgresqlmysqlpytorchtensorflowkerastransformershugging facelangchainllamascikit-learnnumpypandasopencvpytestlinuxjupyter