Documentation

FlagOS Core Libraries

FlagGems

A high-performance general-purpose operator library implemented with the Triton programming language and its extended languages.

View Documentation

FlagTree

An open-source, unified compiler for multiple AI chips.

View Documentation

FlagScale

A comprehensive toolkit designed to support the entire lifecycle of large models.

View Documentation

FlagCX

A scalable and adaptive unified communication library for cross-chip environments.

View Documentation

Fused Operator Libraries

FlagGems-vllm

A high-performance deep learning operator library.

View Documentation

Multi-Domain Operator Libraries

FlagDNN

A deep neural network computing library oriented towards multiple chip backends.

View Documentation

FlagBLAS

A computing library that follows the BLAS standard interface.

View Documentation

FlagFFT

A JIT-compiled GPU FFT library via Triton/TLE.

View Documentation

FlagSparse

A domain-specific operator library for sparse computation scenarios.

View Documentation

FlagTensor

A high-performance tensor-primitive library implemented in Triton.

View Documentation

FlagAudio

A multi-backend computing library for audio signal processing.

View Documentation

FlagOS Ecosystem Enablement Projects

vllm-plugin-FL

A plugin for the vLLM inference/serving framework, built on FlagOS's unified multi-chip backend — including the unified operator library FlagGems and the unified communication library FlagCX.

View Documentation

Megatron-LM-FL

A fork of Megatron-LM that introduces a plugin-based architecture for supporting diverse AI chips, built on top of FlagOS, a unified open-source AI system software stack.

View Documentation

TransformerEngine-FL

A fork of TransformerEngine that introduces a plugin-based architecture for supporting diverse AI chips, built on top of FlagOS, a unified open-source AI system software stack.

View Documentation

verl-FL

A fork of verl (Volcano Engine Reinforcement Learning for LLMs) that extends the upstream library with multi-chip/multi-hardware support via the FlagOS ecosystem.

View Documentation

PyTorch-Plugin-FL

A custom PyTorch device plugin based on the PrivateUse1 extension mechanism, registering FlagGems high-performance Triton operators as the flagos device backend for unified multi-chip support.

View Documentation

sglang-plugin-FL

An out-of-tree (OOT) plugin for SGLang, built on FlagOS's unified multi-chip backend — including the unified operator library FlagGems and the unified communication library FlagCX. It extends SGLang's inference capabilities across diverse hardware platforms.

View Documentation

FlagOS Domain-Specific Projects

FlagOS-Robo

An integrated training and inference framework for AI models used in robots, so-called Embodied Intelligence.

View Documentation

FlagQuantum

A high-performance distributed quantum statevector simulator built on PyTorch, enabling quantum circuit simulation across multiple GPUs with automatic sharding and resharding.

View Documentation

FlagOS Developer Tools