Gang Liao

Gang Liao · 2025-01-08T19:50:11.787Z

https://lnkd.in/gA92cBSR spotlights the key challenges columnar formats face in modern ML—from handling wide, sparse datasets to tackling data compliance and (embedded) vector support. It’s time for new storage formats tailor-made for today’s AI and ML demands. Check out the paper to learn more!

Mountain View, California, United States
1K followers 500+ connections

View mutual connections with Gang

Gang can introduce you to 10+ people at Meta

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

About

I build the systems that make ML work at scale — from silicon to software. At Meta…

Activity

1K followers

Gang Liao

Gang Liao

2mo
Report this post
Gang Liao shared this
Our paper, KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta, will appear at ISCA 2026. KernelEvolve generates and optimizes GPU kernels, delivering 25-60%+ QPS gains on ad production models across NVIDIA/AMD/MTIA in hours instead of weeks. Technical deep dive on the Meta Engineering Blog: https://lnkd.in/eNtixwiM P.S. If you're a PhD student interested in a 2027 summer internship working on agentic AI systems and kernel optimization, feel free to reach out! Meta International Symposium on Computer Architecture (ISCA)

public_profile__posts
Gang Liao reposted this
Report this post
Gang Liao reposted this

Matt Steiner

Matt Steiner

2mo

Gang Liao reposted this
Excited to share Ranking Engineer Agent's KernelEvolve — an agentic AI system that autonomously writes and optimizes the low-level kernels powering Meta's AI infrastructure. Meta's diverse hardware fleet (NVIDIA/AMD GPUs, custom MTIA chips, CPUs) requires tuned kernels for every model-hardware combination. Manual expert tuning doesn't scale. Results: 60%+ inference throughput improvement for the Andromeda Ads model on NVIDIA GPUs 25%+ training throughput improvement for an ads model on MTIA silicon Weeks of expert tuning compressed into hours of automated search Runs across NVIDIA, AMD, MTIA, and CPU — generating kernels in Triton, CUDA, HIP, and more Kernel development is no longer a manual bottleneck — it's continuous and automated. Our paper, KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta, will appear at ISCA 2026. Technical deep dive on the Meta Engineering Blog at https://lnkd.in/eNtixwiM. Congratulations to the team! 🙌

KernelEvolve: How Meta’s Ranking Engineer Agent Optimizes AI Infrastructure

KernelEvolve: How Meta’s Ranking Engineer Agent Optimizes AI Infrastructure
1 Comment
Gang Liao

Gang Liao

5mo
Report this post
Gang Liao shared this
Excited to share our recent work on KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta. We designed, implemented, and deployed KernelEvolve to optimize a wide variety of production recommendation models across generations of NVIDIA and AMD GPUs, as well as Meta’s latest-generation AI accelerators (MTIA v3). Writing high-performance GPU kernels is a complex challenge that typically demands years of deep expertise and remains a major focus of industry and academic research. It’s truly impressive to see KernelEvolve not only achieve state-of-the-art results on open benchmarks, but also deliver 1.25–17x speedups across Meta production use cases. This milestone was made possible by outstanding collaboration across Meta—including teams from Monetization Infra and Ranking, FAIR, Compiler, MTIA, Serverless Compute, and more. Thank you to everyone for your dedication and teamwork in making this breakthrough happen! You can read the full paper here: 👉 https://lnkd.in/gdPb43EZ This is only ~1% of the journey. There is much more ahead in 2026 as we continue pushing the boundaries. If your background aligns (Agentic, LLM, RL, AI compiler, Kernels, Inference/training optimization, Data Infra for prompts/context etc.) and you’re interested in joining us on this journey, feel free to DM me. We’re hiring.

public_profile__posts
13 Comments
Gang Liao

Gang Liao

5mo
Report this post
Gang Liao shared this
Excited to share our recent work on KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta.
Gang Liao

Gang Liao

1y
Report this post
Gang Liao shared this
https://lnkd.in/gA92cBSR spotlights the key challenges columnar formats face in modern ML—from handling wide, sparse datasets to tackling data compliance and (embedded) vector support. It’s time for new storage formats tailor-made for today’s AI and ML demands. Check out the paper to learn more!

Starburst

Starburst

1y

Gang Liao shared this
Are Parquet and ORC holding back your ML workloads? In his latest article, Daniel Abadi, Computer Science Professor at the University of Maryland, highlights key challenges these columnar formats face with modern ML workloads, from handling wide, sparse datasets to data compliance and vector support. It’s time for new formats tailored for today’s AI and ML demands. Learn more: https://okt.to/TM39Ge #MachineLearning #BigData #AI

Parquet and ORC’s many shortfalls for machine learning (ML) workloads, and what should be done about them

Parquet and ORC’s many shortfalls for machine learning (ML) workloads, and what should be done about them

Gang Liao liked this
Report this post
Gang Liao liked this

Amin Vahdat

Amin Vahdat

18h

Gang Liao liked this
Graduation and intern season is easily one of my favorite times of the year at Google. There’s an undeniable energy that comes with a new cohort of people arriving ready to question things, learn, and build. Hearing Sundar Pichai speak at Stanford University's commencement recently reminded me of how essential it is to ground ourselves in what truly matters when navigating a rapidly changing world. As a former professor and a proud father of a recent college grad, I decided to share some of the important lessons that have informed my own journey: Pick a great team then and foster critical skills. People matter much more than projects. The right team and the right mentors will teach you the discipline and rigor required to bring structure to highly ambiguous problems. This capacity for articulating intricate design decisions and trade-offs is a formidable skill that will serve you indefinitely. Further, always stay "in school" to maintain the intellectual curiosity necessary to continually broaden your knowledge. Invest in people first. Bet on people. Don't hesitate to seek support when necessary, but prioritize contributing more than you take. True leadership isn't about barking orders from the lead; it involves mastering how to follow at both the start and end, guiding others from the rear and stepping in only when necessary. The relationships you establish and the growth you foster will mean much more than any individual milestone. Persevere. How you handle adversity is much more important than how you handle success. Expect significant challenges as part of your growth. The challenges that seemed insurmountable a few years ago will become routine, but only through perseverance. Reputation is repetition. Say what you mean, and mean what you say. It’s the consistency in your actions and principles that builds your reputation. Authenticity and integrity are everything in engineering and leadership. Build your career on intrinsic motivation, follow what genuinely excites you, and stay anchored to your core values. Kindness is the ultimate nobility. In the realm of building world-scale systems and leading brilliant minds, true influence is not measured solely or even mostly by technical milestones or architectural breakthroughs, but by the empathy and respect we extend to those around us. To all new graduates, incoming interns and anyone stepping into a new chapter, wishing you endless success and perseverance.
13 Comments
Gang Liao liked this
Report this post
Gang Liao liked this

Tianqi Chen

Tianqi Chen

21h

Gang Liao liked this
One of the most fun parts of working in AI systems is having to relearn GPU programming every few years. Blackwell is a completely different beast from what we are used to, yet most existing courses still stoped at GPU programming a decade ago while modern kernel optimizations have long moved past that. This year in our CMU ML Systems course, we decided to change that. We took a crash-course approach to teaching modern (Blackwell) GPU programming for the very first time. Along the way, a lot of fun questions popped up: - What is data layout? - What is data swizzling? - How do you use 3D TMA for one-shot tiling and swizzling? - What does it actually take to write high-performance kernels today? Huge thanks to our amazing course staff and to Modal for the compute support. Together, we finished the first iteration as part of a mini-lecture series in the ML systems course , packed with interactive materials to answer those exact questions. Now, we’ve polished those materials and turned them into a free online book: "Modern GPU Programming for MLSys". We released the book along with a minimal compiler (checkout Bohan Hou's post) that supports many hands-on examples for developers and agents. Check it out https://lnkd.in/gzFQ-CDa

public_profile__reactions
4 Comments
Gang Liao liked this
Report this post
Gang Liao liked this

Waleed Atallah

Waleed Atallah

1d

Gang Liao liked this
Today we are open-sourcing 600,000+ Trition kernels as part of a collaboration with researchers at Google and Cornell University. The dataset includes full evaluation results and is designed to provide a representative distribution of LLM-generated GPU kernels. Its available for free on Hugging Face. Pushing the frontier of GPU code generation is a rising tide that lifts all boats in the AI race, with an outsized impact on open source projects! Dataset: https://lnkd.in/eNhE9bS8 Blog: https://lnkd.in/edbMpQrQ

Open-sourcing 600,000 Triton kernels via Hugging Face — Makora

Open-sourcing 600,000 Triton kernels via Hugging Face — Makora
5 Comments
Gang Liao liked this
Report this post
Tianqi Chen

Tianqi Chen

1d

Gang Liao liked this
What will the role of AI compilers be in the age of AI agents and frontier kernel programming? We believe agents should have access to a predictable DSL that offers maximum expressiveness, paired with a minimal compiler they can directly open up, build toolings, and improve for specialized optimizations. TIRx is our effort on this front. We've had a great experience using it in our latest mega-kernel compiler research and teaching Blackwell programming in our ML systems course at CMU. Check it out

Bohan Hou

Bohan Hou

1d

Gang Liao liked this
We release TIRx today, a minimal compiler stack and hardware-native DSL for frontier ML kernels, built around storage-first tensor layouts and reusable tile primitives. Blog post: https://lnkd.in/da86cx8Z On NVIDIA B200, TIRx delivers up to ~1.08× over cuBLASLt on dense GEMM, outperforms DeepGEMM on all FP8 blockwise workloads with up to ~1.09× speedup, keeps FA4 typically within ~±2% of CuTeDSL, and remains competitive with cuBLASLt/FlashInfer on NVFP4 GEMM. The release also comes with the accompanying Modern GPU Programming for ML Systems course/book, which explains the hardware and programming concepts behind the kernels. Through our past experiences building frontier ML kernels, megakernels, and agentic kernel systems, we kept seeing the same boundary problem: new operators and new hardware require new optimization strategies that often break existing programming models or compiler-pass boundaries. This led us to a simple goal: users and agents should always be able to express the best-performing program, even for future hardware generations, while keeping the engineering effort for writing new kernels and extending to new hardware as low as possible. TIRx builds on top of Apache TVM and moves toward this goal by drawing a lower and more stable compiler boundary. Instead of hiding hardware-native orchestration too early, TIRx keeps key kernel details explicit in source code: memory placement, synchronization, pipeline state, role assignment, backend intrinsics, and layout choices. At the same time, it makes recurring tile-level patterns reusable and compiler-visible through storage-first tensor layouts, execution scopes, and tile primitive dispatch. The design is intentionally minimal. New hardware features can first be exposed as intrinsics. Recurring patterns can later become reusable primitives. Higher-level automation, search, and agentic optimization can then be built on top without blocking expert kernel authors from expressing the program they actually want. With this release, we hope TIRx can serve as a stable substrate for the next layer of kernel work: expert-written frontier kernels, composable megakernel systems, and agentic kernel optimization built on a compiler-visible native IR.

TIRx: An Open Compiler Stack for Evolving Frontier ML Kernels

TIRx: An Open Compiler Stack for Evolving Frontier ML Kernels
6 Comments
Gang Liao liked this
Report this post
Gang Liao liked this

Reflection

Reflection

1d

Gang Liao liked this
Reflection has signed a compute agreement with SpaceXAI, securing additional capacity at Colossus 2. More compute gives us more room to push the frontier on open models. We're hiring researchers, engineers, and builders across the company. See our open roles to join us: https://lnkd.in/gSTWvqA5 https://lnkd.in/gtvxu4vK

Careers

Careers
3 Comments
Gang Liao liked this
Report this post
Gang Liao liked this

Kuldeep Singh Sidhu

Kuldeep Singh Sidhu

3w

Gang Liao liked this
Building Enterprise Search Agents That Learn Their Own Search Strategies The Databricks AI Research team just published KARL, a sophisticated approach to training knowledge agents through reinforcement learning that achieves state-of-the-art performance on diverse enterprise search tasks. What makes this work stand out technically: The system uses a two-stage agentic synthesis pipeline. During data generation, agents dynamically explore document corpora using vector search, creating grounded question-answer pairs. The key insight: by training on increasingly capable models through iterative bootstrapping, the synthesis quality itself improves in subsequent training rounds. For post-training, KARL implements OAPL- an iterative large-batch off-policy RL approach that sidesteps the instability issues plaguing online RL methods at scale. By embracing off-policy design in the objective function itself, the method avoids common heuristics like importance weighting clipping or router replay, significantly reducing infrastructure complexity. The context management is particularly elegant. Rather than using a separate compression model, the agent learns to compress its own history end-to-end during RL training, optimizing for task rewards. When accumulated context exceeds thresholds during long search trajectories, the model summarizes its findings in-place before continuing. At inference time, the system employs two complementary test-time compute strategies: Parallel Thinking generates multiple independent rollouts that an aggregator agent synthesizes into a unified response, while Value-Guided Search trains a value model to predict success probability at any token position, steering a breadth-first tree search toward high-confidence branches. The framework, evaluated across six distinct search regimes spanning constraint-driven entity search to tabular reasoning, demonstrates that training across heterogeneous tasks produces substantially better out-of-distribution generalization than single-task optimization.

public_profile__reactions
3 Comments
Gang Liao liked this
Report this post
Gang Liao liked this

Sakana AI

Sakana AI

2mo

Gang Liao liked this
We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system! Blog: https://lnkd.in/gVXVa-VN Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task. Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning Apply for the beta test here: https://lnkd.in/g4Wm8t-a

public_profile__reactions
4 Comments
Gang Liao liked this
Report this post
Gang Liao liked this

ACM International Conference on Supercomputing 2026

ACM International Conference on Supercomputing 2026

6d

Gang Liao liked this
💡 ICS 2026 Keynote Spotlight: Dr Carole-Jean Wu 💡 “Scaling AI Computing Sustainably: A Journey Towards Sustainable AI” At ACM ICS 2026, we are delighted to welcome Dr Carole-Jean Wu, Director of AI Research at Meta, for the first keynote of the conference. This keynote will explore efficiency and sustainability opportunities across the AI lifecycle — from model development to infrastructure, datacenter operations, and hardware lifecycle impacts. 📅 7 July 2026 📍 Belfast, Northern Ireland, UK 🔗 Full abstract and bio: https://lnkd.in/eV8ZWVKQ Join us for the 40th edition of ACM ICS! #ICS2026 #ACM #Supercomputing #HPC #SustainableAI #SIGHPC #SIGARCH

public_profile__reactions
1 Comment
Gang Liao liked this
Report this post
Gang Liao liked this

Vivek Raghunathan

Vivek Raghunathan

6d

Gang Liao liked this
➡️Adoption is just the beginning.⬅️ We've watched engineers use the exact same AI tools and get completely different results. The gap isn't the model — it's the workflow. At Snowflake, we've identified 14 design patterns that consistently show up in our highest-impact engineers. They emerged from a group of fearless explorers in our org who pushed the tools further than anyone expected. These patterns are the difference between "AI saves me 20 minutes a day" and "AI handles 80% of my job." Same tool. Different patterns. Getting your team to adopt AI is the easy part. The real leadership challenge is helping them find — and scale — the workflows that actually move the needle. #SnowflakeSummit #AI #DeveloperProductivity #Engineering

public_profile__reactions
5 Comments

See all activities

Experience

Meta

Sunnyvale, CA
-

San Jose, California, United States
-

Menlo Park, California, United States
-

Redmond, Washington, United States
-

Bellevue, Washington, United States
-
-
-

Education

University of Maryland

Advisor: Daniel Abadi (https://en.wikipedia.org/wiki/Daniel_Abadi)

Ph.D. Thesis: The Evolution of Cloud Data Architectures: Storage, Compute, and Migration
https://drum.lib.umd.edu/handle/1903/29199
-

Publications

KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta

ISCA'26 December 29, 2025
Making deep learning recommendation model (DLRM) training and inference fast and efficient is important. However, this presents three key system challenges - model architecture diversity, kernel primitive diversity, and hardware generation and architecture heterogeneity. This paper presents KernelEvolve-an agentic kernel coding framework-to tackle heterogeneity at-scale for DLRM. KernelEvolve is designed to take kernel specifications as input and automate the process of kernel generation and…

Making deep learning recommendation model (DLRM) training and inference fast and efficient is important. However, this presents three key system challenges - model architecture diversity, kernel primitive diversity, and hardware generation and architecture heterogeneity. This paper presents KernelEvolve-an agentic kernel coding framework-to tackle heterogeneity at-scale for DLRM. KernelEvolve is designed to take kernel specifications as input and automate the process of kernel generation and optimization for recommendation model across heterogeneous hardware architectures. KernelEvolve does so by operating at multiple programming abstractions, from Triton and CuTe DSL to low-level hardware agnostic languages, spanning the full hardware-software optimization stack. The kernel optimization process is described as graph-based search with selection policy, universal operator, fitness function, and termination rule, dynamically adapts to runtime execution context through retrieval-augmented prompt synthesis. We designed, implemented, and deployed KernelEvolve to optimize a wide variety of production recommendation models across generations of NVIDIA and AMD GPUs, as well as Meta's AI accelerators. We validate KernelEvolve on the publicly-available KernelBench suite, achieving 100% pass rate on all 250 problems across three difficulty levels, and 160 PyTorch ATen operators across three heterogeneous hardware platforms, demonstrating 100% correctness. KernelEvolve reduces development time from weeks to hours and achieves substantial performance improvements over PyTorch baselines across diverse production use cases and for heterogeneous AI systems at-scale. Beyond performance efficiency improvements, KernelEvolve significantly mitigates the programmability barrier for new AI hardware by enabling automated kernel generation for in-house developed AI hardware.

Other authors
See publication
SFVInt: Simple, Fast and Generic Variable-Length Integer Decoding using Bit Manipulation Instructions

20th International Workshop on Data Management on New Hardware (DaMoN) April 28, 2024
The ubiquity of variable-length integers in data storage and communication necessitates efficient decoding techniques. In this paper, we present SFVInt, a simple and fast approach to decode the prevalent Little Endian Base-128 (LEB128) varints. Our approach, distilled into a mere 500 lines of code, effectively utilizes the Bit Manipulation Instruction Set 2 (BMI2) in modern Intel and AMD processors, achieving significant performance improvement while maintaining simplicity and avoiding…

The ubiquity of variable-length integers in data storage and communication necessitates efficient decoding techniques. In this paper, we present SFVInt, a simple and fast approach to decode the prevalent Little Endian Base-128 (LEB128) varints. Our approach, distilled into a mere 500 lines of code, effectively utilizes the Bit Manipulation Instruction Set 2 (BMI2) in modern Intel and AMD processors, achieving significant performance improvement while maintaining simplicity and avoiding overengineering. SFVInt, with its generic design, effectively processes both 32-bit and 64-bit unsigned integers using a unified code template, marking a significant leap forward in varint decoding efficiency. We thoroughly evaluate SFVInt's performance across various datasets and scenarios, demonstrating that it achieves up to a 2x increase in decoding speed when compared to varint decoding methods used in established frameworks like Facebook Folly and Google Protobuf.

Other authors
See publication
Bullion: A Column Store for Machine Learning

15th Conference on Innovative Data Systems Research (CIDR), 2025 April 13, 2024
The past two decades have witnessed columnar storage revolutionizing data warehousing and analytics. However, the rapid growth of machine learning poses new challenges to this domain. This paper presents Bullion, a columnar storage system tailored for machine learning workloads. Bullion addresses the complexities of data compliance, optimizes the encoding of long sequence sparse features, efficiently manages wide-table projections, and introduces feature quantization in storage. By aligning…

The past two decades have witnessed columnar storage revolutionizing data warehousing and analytics. However, the rapid growth of machine learning poses new challenges to this domain. This paper presents Bullion, a columnar storage system tailored for machine learning workloads. Bullion addresses the complexities of data compliance, optimizes the encoding of long sequence sparse features, efficiently manages wide-table projections, and introduces feature quantization in storage. By aligning with the evolving requirements of ML applications, Bullion extends columnar storage to various scenarios, from advertising and recommendation systems to the expanding realm of Generative AI.

Preliminary experimental results and theoretical analysis demonstrate Bullion’s superior performance in handling the unique demands of machine learning workloads compared to existing columnar storage solutions. Bullion significantly reduces I/O costs for deletion compliance, achieves substantial storage savings with its optimized encoding scheme for sparse features, and drastically improves metadata parsing speed for wide-table projections. These advancements position Bullion as a critical component in the future of machine learning infrastructure, enabling organizations to efficiently manage and process the massive volumes of data required for training and inference in modern AI applications.

Other authors
See publication
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms

December 27, 2023
In this paper, we present Flock, a cloud-native streaming query engine that leverages the on-demand elasticity of Function-as-a-Service (FaaS) platforms to perform real-time data analytics. Traditional server-centric deployments often suffer from resource under- or over-provisioning, leading to resource wastage or performance degradation. Flock addresses these issues by providing more fine-grained elasticity that can dynamically match the per-query basis with continuous scaling, and its billing…

In this paper, we present Flock, a cloud-native streaming query engine that leverages the on-demand elasticity of Function-as-a-Service (FaaS) platforms to perform real-time data analytics. Traditional server-centric deployments often suffer from resource under- or over-provisioning, leading to resource wastage or performance degradation. Flock addresses these issues by providing more fine-grained elasticity that can dynamically match the per-query basis with continuous scaling, and its billing methods are more fine-grained with millisecond granularity, making it a low-cost solution for stream processing. Our approach, payload invocation, eliminates the need for external storage services and eliminates the requirement for a query coordinator in the data architecture. Our evaluation shows that Flock significantly outperforms state-of-the-art systems in terms of cost, especially on ARM processors, making it a promising solution for real-time data analytics on FaaS platforms.

Other authors
See publication
FileScale: Fast and Elastic Metadata Management for Distributed File Systems

Proceedings of the 14th ACM Symposium on Cloud Computing (SoCC) September 1, 2023
Recent work has shown that distributed database systems are a promising solution for scaling metadata management in scalable file systems. This work has shown that systems that store metadata on a single machine, or over a shared-disk abstraction, struggle to scale performance to deployments including billions of files. In contrast, leveraging a scalable, shared-nothing, distributed system for metadata storage can achieve much higher levels of scalabil- ity, without giving up high availability…

Recent work has shown that distributed database systems are a promising solution for scaling metadata management in scalable file systems. This work has shown that systems that store metadata on a single machine, or over a shared-disk abstraction, struggle to scale performance to deployments including billions of files. In contrast, leveraging a scalable, shared-nothing, distributed system for metadata storage can achieve much higher levels of scalabil- ity, without giving up high availability guarantees. However, for low-scale deployments – where metadata can fit in memory on a single machine – these systems that store metadata in a distributed database typically perform an order of magnitude worse than systems that store metadata in memory on a single machine. This has limited the impact of these distributed database approaches, since they are only currently applicable to file systems of extreme scale.
This paper describes FileScale, a three-tier architecture that incorporates a distributed database system as part of a comprehen- sive approach to metadata management in distributed file systems. In contrast to previous approaches, the architecture described in the paper performs comparably to the single-machine architecture at a small scale, while enabling linear scalability as the file system metadata increases.

Other authors
See publication
BullFrog: Online Schema Evolution via Lazy Evaluation

Proceedings of the 2021 International Conference on Management of Data (SIGMOD 2021) June 20, 2021
BullFrog is a relational DBMS that supports single-step schema migrations --- even those that are backwards incompatible --- without downtime, and without need for advanced warning. When a schema migration is submitted, BullFrog initiates a logical switch to the new schema, but physically migrates affected data lazily, as it is accessed by incoming transactions. BullFrog's internal concurrency control algorithms and data structures enable concurrent processing of schema migration operations…

BullFrog is a relational DBMS that supports single-step schema migrations --- even those that are backwards incompatible --- without downtime, and without need for advanced warning. When a schema migration is submitted, BullFrog initiates a logical switch to the new schema, but physically migrates affected data lazily, as it is accessed by incoming transactions. BullFrog's internal concurrency control algorithms and data structures enable concurrent processing of schema migration operations with post-migration transactions while ensuring exactly-once migration of all old data into the physical layout required by the new schema. BullFrog is implemented as an open source extension to PostgreSQL. Experiments using this prototype over a TPC-C based workload (supplemented to include schema migrations) show that BullFrog can achieve zero-downtime migration to non-trivial new schemas with near-invisible impact on transaction throughput and latency.

Other authors
See publication

View Gang’s full profile

See who you know in common
Get introduced
Contact Gang directly

Join to view full profile

Other similar profiles

Xiangyi Chen

Xiangyi Chen

Mountain View, CA

Connect
Charuta Pethe

Charuta Pethe

Sunnyvale, CA

Connect
Sopan Khosla

Sopan Khosla

San Francisco Bay Area

Connect
Ashwini Badgujar

Ashwini Badgujar

San Francisco, CA

Connect
Mansi Khemka

Mansi Khemka

United States

Connect
Vishal Lal

Vishal Lal

Los Angeles, CA

Connect
Harit Vishwakarma

Harit Vishwakarma

Oxford

Connect
Setu Shah

Setu Shah

Seattle, WA

Connect
Kai Rawal

Kai Rawal

London

Connect
Hao Liu

Hao Liu

United States

Connect
Vatsal Sodha

Vatsal Sodha

Dallas-Fort Worth Metroplex

Connect
Omid Poursaeed

Omid Poursaeed

New York, NY

Connect
Gagan Somashekar

Gagan Somashekar

Denver, CO

Connect
Medha Sagar

Medha Sagar

Los Angeles Metropolitan Area

Connect
Ruoyu Li

Ruoyu Li

Bellevue, WA

Connect
Xikun Zhang

Xikun Zhang

San Francisco, CA

Connect
Aniruddha Tapas

Aniruddha Tapas

Vancouver, BC

Connect
Hanhan Zhou

Hanhan Zhou

Santa Clara, CA

Connect
Sangeeta Chowdhary

Sangeeta Chowdhary

Sunnyvale, CA

Connect
Rahul Mittal

Rahul Mittal

Austin, TX

Connect

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Others named Gang Liao

188 others named Gang Liao are on LinkedIn

See others named Gang Liao

Add new skills with these courses

See all courses

Gang Liao

Mountain View, California, United States 1K followers 500+ connections

About

Activity

1K followers

Gang Liao

Matt Steiner

Gang Liao

Gang Liao

Gang Liao

Starburst

Amin Vahdat

Tianqi Chen

Waleed Atallah

Tianqi Chen

Bohan Hou

Reflection

Kuldeep Singh Sidhu

Sakana AI

ACM International Conference on Supercomputing 2026

Vivek Raghunathan

Experience

-

-

-

-

-

-

-

Education

-

Publications

ISCA'26 December 29, 2025

20th International Workshop on Data Management on New Hardware (DaMoN) April 28, 2024

15th Conference on Innovative Data Systems Research (CIDR), 2025 April 13, 2024

December 27, 2023

Proceedings of the 14th ACM Symposium on Cloud Computing (SoCC) September 1, 2023

Proceedings of the 2021 International Conference on Management of Data (SIGMOD 2021) June 20, 2021

View Gang’s full profile

Other similar profiles

Xiangyi Chen

Charuta Pethe

Sopan Khosla

Ashwini Badgujar

Mansi Khemka

Vishal Lal

Harit Vishwakarma

Setu Shah

Kai Rawal

Hao Liu

Vatsal Sodha

Omid Poursaeed

Gagan Somashekar

Medha Sagar

Ruoyu Li

Xikun Zhang

Aniruddha Tapas

Hanhan Zhou

Sangeeta Chowdhary

Rahul Mittal

Explore more posts

Explore top content on LinkedIn

Others named Gang Liao

GANG LIAO

Gang (Jerry) Liao

gang liao

Hong-Gang Liao

Add new skills with these courses

Local AI: Build a RAG Model from Scratch with Open-Source Tools

Create Your Own Code Assistant with Llama 2, Node.js, and React.js

Building a RAG Solution from Scratch

Mountain View, California, United States
1K followers 500+ connections