huggingface-gemma-recipes
是 Hugging Face 官方維護的一個開源項目,旨在為用戶提供與 Google Gemma 系列模型相關的最小化示例代碼和教程。該項目的核心目標是幫助開發者快速上手 Gemma 模型的推理、微調和各種實際應用場景。
該項目支持 Gemma 3 系列模型的多模態能力:
項目提供了統一的模型推理接口,支持快速加載和使用 Gemma 模型:
from transformers import AutoProcessor, AutoModelForImageTextToText
import torch
model_id = "google/gemma-3n-e4b-it" # 或 google/gemma-3n-e2b-it
processor = AutoProcessor.from_pretrained(model_id)
model = AutoModelForImageTextToText.from_pretrained(model_id).to(device)
def model_generation(model, messages):
inputs = processor.apply_chat_template(
messages,
add_generation_prompt=True,
tokenize=True,
return_dict=True,
return_tensors="pt",
)
input_len = inputs["input_ids"].shape[-1]
inputs = inputs.to(model.device, dtype=model.dtype)
with torch.inference_mode():
generation = model.generate(**inputs, max_new_tokens=32, disable_compile=False)
generation = generation[:, input_len:]
decoded = processor.batch_decode(generation, skip_special_tokens=True)
print(decoded[0])
# 文本問答
messages = [
{
"role": "user",
"content": [
{"type": "text", "text": "What is the capital of France?"}
]
}
]
model_generation(model, messages)
# 語音轉文本
messages = [
{
"role": "user",
"content": [
{"type": "text", "text": "Transcribe the following speech segment in English:"},
{"type": "audio", "audio": "https://huggingface.co/datasets/ariG23498/demo-data/resolve/main/speech.wav"},
]
}
]
model_generation(model, messages)
# 圖像描述
messages = [
{
"role": "user",
"content": [
{"type": "image", "image": "https://huggingface.co/datasets/ariG23498/demo-data/resolve/main/airplane.jpg"},
{"type": "text", "text": "Describe this image."}
]
}
]
model_generation(model, messages)
項目提供了多種微調方案和腳本:
# 安裝依賴
$ pip install -U -q -r requirements.txt
# 安裝核心依賴
$ pip install -U -q transformers timm
# 安裝完整依賴(用於微調)
$ pip install -U -q -r requirements.txt
huggingface-gemma-recipes/
├── notebooks/ # Jupyter 筆記本教程
│ └── fine_tune_gemma3n_on_t4.ipynb
├── scripts/ # 微調腳本
│ ├── ft_gemma3n_image_vt.py
│ ├── ft_gemma3n_audio_vt.py
│ └── ft_gemma3n_image_trl.py
├── requirements.txt # 依賴列表
└── README.md # 項目說明
該項目作為 Hugging Face 官方維護的開源項目,具有以下優勢:
huggingface-gemma-recipes
是一個高質量的開源項目,為 Gemma 模型的使用提供了完整的解決方案。無論是初學者還是有經驗的開發者,都能從中找到適合的資源和指導。項目的多模態支持和靈活的微調方案使其成為當前 AI 開發領域的重要工具之一。