Lightning-AI/litgptPlease refer to the latest official releases for information GitHub Homepage

High-performance large language model toolbox supporting pre-training, fine-tuning, and deployment of 20+ models.

Apache-2.0Python 12.3kLightning-AI Last Updated: 2025-06-18

LitGPT - High-Performance Large Language Model Toolkit

Project Overview

LitGPT is an open-source large language model toolkit developed by Lightning AI, providing pre-training, fine-tuning, and scaling deployment solutions for 20+ high-performance large language models. Built on Lightning Fabric, this project extends Lit-LLaMA and nanoGPT, focusing on delivering enterprise-grade LLM training and deployment capabilities.

Key Features

✅ Enterprise-Grade Features

Apache 2.0 License - Supports unrestricted enterprise-level usage
Developer-Friendly - Single-file implementation without abstraction layers for easy debugging
Performance Optimized - Designed for maximum performance, reduced costs, and accelerated training
Validated Recipes - Highly optimized training/fine-tuning recipes tested at enterprise scale

✅ Advanced Technical Features

From-Scratch Implementation - All models are written from scratch, without abstraction layers, for complete control
Flash Attention v2 - Latest attention mechanism optimization
Multi-GPU Support - Achieved through Fully Sharded Data Parallelism (FSDP)
Memory Optimization - Optional CPU offloading and TPU/XLA support
Quantization Techniques - Supports 4-bit floating point, 8-bit integer, and double quantization
Parameter-Efficient Fine-Tuning - Supports LoRA, QLoRA, Adapter, and Adapter v2

Supported Models

LitGPT supports 20+ mainstream large language models, including:

Major Model Families

Llama Series - Llama 3, 3.1, 3.2, 3.3 (1B-405B parameters)
CodeGemma - 7B model specifically for code generation
Gemma Series - Google's open-source models
Mistral Series - Including Mistral 7B and Mixtral, etc.
Phi Series - Microsoft's small and efficient models
Qwen Series - Alibaba's multilingual models
DeepSeek R1 - Latest inference model

Featured Models

Falcon Series - High-performance models developed by TII
StableLM - Stability AI's stable language model
TinyLlama - Lightweight Llama variant
SmolLM - Hugging Face's small model

Core Functionalities

1. Quick Start

from litgpt import LLM
llm = LLM.load("microsoft/phi-2")
text = llm.generate("Fix the spelling: Every fall, the family goes to the mountains.")
print(text)

2. Model Fine-Tuning

Supports multiple fine-tuning methods:

Full Fine-Tuning - Training all parameters
LoRA Fine-Tuning - Low-Rank Adaptation fine-tuning
QLoRA - Quantized LoRA fine-tuning
Adapter Fine-Tuning - Adapter layer fine-tuning

Example command:

litgpt finetune microsoft/phi-2 \
  --data JSON \
  --data.json_path my_custom_dataset.json \
  --data.val_split_fraction 0.1 \
  --out_dir out/custom-model

3. Model Pre-Training

Supports training from scratch and continued pre-training:

litgpt pretrain EleutherAI/pythia-160m \
  --tokenizer_dir EleutherAI/pythia-160m \
  --data TextFiles \
  --data.train_data_path "custom_texts/" \
  --train.max_tokens 10_000_000 \
  --out_dir out/custom-model

4. Model Deployment

Supports one-click deployment as a web service:

# Deploy pre-trained model
litgpt serve microsoft/phi-2

# Deploy custom model
litgpt serve out/custom-model/final

5. Model Evaluation

Supports multiple evaluation benchmarks:

litgpt evaluate microsoft/phi-2 --tasks 'truthfulqa_mc2,mmlu'

6. Interactive Chat

litgpt chat microsoft/phi-2

Technical Advantages

Performance Optimization

Optimized Inference Speed - Specifically optimized for fast inference
Quantization Support - Reduces memory footprint
Low-Memory GPU Operation - Supports resource-constrained environments
Production-Grade Scaling - Supports 1-1000+ GPUs/TPUs

Memory and Computation Optimization

Mixed Precision Training - Supports FP16, BF16, FP32 mixing
Gradient Checkpointing - Reduces memory usage
CPU Offloading - Handles ultra-large models
Distributed Training - Multi-node multi-GPU support

Configurable Training

LitGPT provides validated YAML configuration files covering different training scenarios:

litgpt finetune \
  --config https://raw.githubusercontent.com/Lightning-AI/litgpt/main/config_hub/finetune/llama-2-7b/lora.yaml

Application Scenarios

Research and Development

Model Research - Provides readable and easily modifiable code
Algorithm Experimentation - Supports rapid implementation of the latest research ideas
Benchmarking - Standardized model evaluation process

Enterprise Applications

Customized Models - Model fine-tuning for specific business scenarios
Production Deployment - Enterprise-grade model service deployment
Cost Optimization - Reduces computational costs through quantization and optimization

Education and Learning

Beginner-Friendly - Clear code structure and detailed documentation
Practical Teaching - Complete process from training to deployment
Research Training - Provides reliable basic tools for researchers

Community Ecosystem

Well-Known Project Support

SAMBA Project - Microsoft's hybrid state-space model developed based on LitGPT
TinyLlama - 300M parameter small model trained using LitGPT
NeurIPS 2023 Challenge - Officially designated LLM efficiency challenge toolkit

Active Open-Source Community

Continuous Updates - Regularly adding new models and features
Community Contributions - Welcoming developers of all levels to participate
Detailed Documentation - Comprehensive tutorials and API documentation

Installation and Usage

Basic Installation

pip install 'litgpt[all]'

Install from Source

git clone https://github.com/Lightning-AI/litgpt
cd litgpt
pip install -e '.[all]'

Basic Usage Flow

Select Model - Choose from 20+ supported models
Prepare Data - Use built-in datasets or custom data
Configure Training - Use preset configurations or custom parameters
Execute Training - Pre-train or fine-tune the model
Model Deployment - Deploy as a production service
Model Evaluation - Use standard benchmarks

Technical Documentation

LitGPT provides complete technical documentation, including:

Quick Start Guide - Complete tutorial from 0 to LitGPT
Fine-Tuning Tutorial - Detailed instructions including LoRA, QLoRA, and Adapter
Pre-Training Guide - Complete process of training a model from scratch
Deployment Documentation - Best practices for production environment deployment
Performance Optimization - OOM error handling and memory optimization techniques
Cloud Deployment - TPU and cloud platform usage guide

Summary

LitGPT is a comprehensive and high-performance large language model toolkit suitable for various application scenarios from research to production. With its design philosophy of from-scratch implementation and no abstraction layers, it provides users with maximum flexibility and control, while ensuring efficient operation under various hardware conditions through rich optimization techniques and configuration options. Whether you are an AI researcher, enterprise developer, or learner, you can find a solution that suits your needs in LitGPT.