-
Microsoft
- Bangalore, India
Lists (4)
Sort Name ascending (A-Z)
Stars
An agentic skills framework & software development methodology that works.
Research repository for AI sandbagging experimentation at Algoverse
Fully local web research and report writing assistant
A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI
SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
Implement code of paper "AgriFM: A Multi-source Temporal Remote Sensing Foundation Model for Agriculture mapping"
Python package and backend for the Elysia platform app.
[ICLR 2026] A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.
Task-Aware Agent-driven Prompt Optimization Framework
Kimi K2 is the large language model series developed by Moonshot AI team
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Roo Code gives you a whole dev team of AI agents in your code editor.
Appwrite® - complete cloud infrastructure for your web, mobile and AI apps. Including Auth, Databases, Storage, Functions, Messaging, Hosting, Realtime and more
Starter Kit for Participants of PairWise Alpha Challenge on https://app.lunor.quest
Tool for generating high quality Synthetic datasets
Easily convert tool, agents and orchestrators from existing agent frameworks to MCP servers
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
Prompt, run, edit, and deploy full-stack web applications. -- bolt.new -- Help Center: https://support.bolt.new/ -- Community Support: https://discord.com/invite/stackblitz
DeepDiff: Deep Difference and search of any Python object/data. DeepHash: Hash of any object based on its contents. Delta: Use deltas to reconstruct objects by adding deltas together.
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess finan…
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.