Skip to main content

Research Engineer @ Adobe · LLM Agent Evals & Benchmarking · AI Security

SDR-Bench · Memento · CSAW ESC '22 winner · WACV & ECCV · Rust systems · IIT Roorkee

// What I work on:
const research = {
  agents:   ["LLM evals", "agent benchmarking", "RL fine-tuning"],
  security: ["adversarial ML", "CTF", "Web3 security"],
  systems:  ["Rust", "OS kernels", "GPU clusters"],
  infra:    ["K8s", "vector DBs", "distributed systems"],
  thesis:   "measure what agents can't do yet"
};

~/publications

Research in AI/ML and computer vision with collaborators from top institutions

4 publications across WACV, ECCV, and BiTW • Collaborations with Adobe Research, Stanford, Microsoft Research, CMU, and premier IITs

Under Review2026

MEMENTO: Leveraging Web as a Learning Signal for Low-Data Domains

Ashutosh Ojha, Vinay Aggarwal, Ashutosh Srivastava, Siddharth Yedlapati, Yaman K Singla, Jitendra Ajmera
Adobe
Under Review
Under Review2026

SDR-Bench: Benchmarking the Personalization Capabilities of Large Language Models

Ashutosh Srivastava, Siddharth Yedlapati, Vinay Aggarwal, Shashwat Dixit, Yaman Kumar Singla
Adobe
Under Review
WACV 20252025

ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models

Ashutosh Srivastava, Tarun Ram Menta, Abhinav Java, Avadhoot Jadhav, Silky Singh, Surgan Jandial, Balaji Krishnamurthy
IIT RoorkeeAdobe ResearchMicrosoft ResearchIIT BombayStanford UniversityCarnegie Mellon University
Winter Conference on Applications of Computer Vision (WACV) & ECCV 2024 Workshop (AI4VA)
ECCV 2024W2024

Towards Efficient Exemplar Based Image Editing with Multimodal VLMs

Avadhoot Jadhav, Ashutosh Srivastava, Abhinav Java, Silky Singh, Tarun Ram Menta, Surgan Jandial, Balaji Krishnamurthy
IIT BombayIIT RoorkeeMicrosoft ResearchStanford UniversityAdobe ResearchCarnegie Mellon University
European Conference on Computer Vision Workshop (ECCV 2024 - AI4VA)

~/achievements

Recognition for contributions to AI research, blockchain innovation, and cybersecurity competitions

~/experience

Applied Research Engineer

Adobe
Jul 2025 - Present

Working on the Sales Qualifier Agent (AJO B2B AO) — an AI-driven application that automates B2B prospect qualification and outreach. Leading personalization research including SDR-Bench, the first benchmark for measuring personalization capabilities of Deep Research agents for B2B sales.

Research Intern

Trinity College Dublin
Dec 2024 - Mar 2025

Built RadBot — an automated brain tumor segmentation and survivability prediction system. Applied multimodal AI to medical imaging at the intersection of ML and clinical neuroscience.

Infrastructure Engineer

Abacus.AI
Oct 2024 - Feb 2025

Managed Kubernetes clusters and microservices across AWS and GCP. Built event-driven architectures with Kafka pipelines, Redis caching, and MySQL binlog processing for real-time data sync.

Research Intern

Adobe
May 2024 - Jul 2024

Developed ReEdit — a novel end-to-end framework for exemplar-based image editing using diffusion models and VLMs. Published at WACV 2025 and ECCV 2024 AI4VA workshop.

Team Captain

InfoSecIITR
Jun 2022 - May 2025

Led cybersecurity initiatives and CTF competitions (Rank #40 globally, #4 in India on CTFtime). Won CSAW ESC 2022 in Research Track for adversarial attacks against ML models. Mentored team members and developed security tools and frameworks. Visit: infoseciitr.in

Developer

SDSLabs
Apr 2022 - May 2025

Contributed to open-source projects at SDSLabs, IIT Roorkee. Developed VectorDB, Katana, RusticOS, and participated in multiple hackathons. Visit: sdslabs.co

~/expertise

Multi-domain technical expertise across cutting-edge technologies

Agentic AI & Evaluation

End-to-end architecture of multi-agent LLM systems. Pioneer of SDR-Arena and Memento benchmarks. Built RL fine-tuning pipelines on 10+8 x A100 GPU clusters with 1.5-3x inference speedups.

PyTorchRLLLMsBenchmarking

Multimodal & Vision-Language

ReEdit (WACV/ECCV) for exemplar-based image editing with diffusion models. RadBot for brain tumor segmentation. Knowledge graphs via internet-scale scraping for LLM traversal.

DiffusionVLMsKnowledge Graphs

AI & Systems Security

CSAW ESC 1st place for adversarial ML. NEAR REDACTED 2024 CTF winner. Team Captain of InfoSecIITR (#4 India, #40 globally). GIAC/SANS certified.

Adversarial MLCTFWeb3 Security

Infrastructure & Distributed Systems

K8s microservices at Abacus.AI across AWS/GCP. Kafka/Redis/MySQL binlog pipelines. GPU orchestration for distributed RL training.

KubernetesKafkaGPU Clusters

Low-Level Systems

RusticOS — modular kernel in Rust with ring 0/3 separation and custom syscall interface. VortexDB — vector database engine with HNSW indexing and DistilBERT vectorizer.

RustOS KernelsVector DBs

Blockchain & Web3

EigenLayer Infinite Prize winner (ValidAI AVS). Proof of Optima using RISC-0 zkVM and Solidity. MCP servers and A2A frameworks for production SSE serving.

SolidityzkVMAVS

~/projects

Open-source contributions and personal projects across multiple domains

SDR-Bench

SDR-Bench

AI/MLFeatured

The first framework to systematically benchmark generative personalization capabilities of LLMs for B2B sales. Features a dual-layered dataset spanning 6,279 articles across 20+ industries.

PythonLLMsNLP
View Project
RusticOS

RusticOS

OS Development

Modular operating system kernel written completely in Rust. Features custom memory management, process scheduling, and x86-64 architecture support.

RustOSSystems
View on GitHub
VortexDB

VortexDB

AI/ML

High-performance vector database core engine built from scratch in Rust. Features HNSW vector indices, DistilBERT image vectorizer flows, and efficient similarity search for ML applications.

RustML InfraHNSW
View on GitHub
RadBot

RadBot

AI/ML

Automated brain tumor segmentation and survivability prediction system. Applies multimodal AI to clinical neuroimaging for accurate diagnosis support.

PythonPyTorchMedical AI
Katana CTF Platform

Katana CTF Platform

Security

Highly available K8s attack-defense CTF platform with strict namespace isolation, automated MongoDB/MySQL provisioning, and health-check cronjobs.

GoKubernetesDocker
View on GitHub
ValidAI

ValidAI

BlockchainFeatured
EigenLayer Infinite Prize

Decentralized AI validation system leveraging Actively Validated Services (AVS). Implements custom consensus for ML model verification on-chain. Winner of EigenLayer Infinite Prize.

SolidityEigenLayerAVS
View on GitHub
Proof of Optima

Proof of Optima

Blockchain

Zero-knowledge proof system for verifiable computation using zkVM and RISC-0. Demonstrates advanced cryptographic protocols for smart contracts.

RISC-0zkVMSolidity
View on GitHub

~/skills

Technical expertise across multiple domains

AI & Machine Learning

PyTorchDiffusion ModelsTransformersLLMsRL Fine-tuningComputer VisionNLPMLOpsAgentic AIKnowledge Graphs

Security & CTF

Adversarial MLWeb AppSecPenetration TestingCTFsWeb3 Security

Infrastructure & DevOps

KubernetesDockerAWSGCPKafkaRedisTerraformCI/CDGPU Orchestration

Blockchain & Web3

Smart ContractsSolidityzkVMRISC-0AVSEigenLayer

Languages & Tools

PythonRustGoC/C++TypeScriptSQLGitLinuxAzure FoundryMCP Servers