gemma#
The Gemma architecture transformer variant.
Supports all the Gemma 1, Gemma 2 and Gemma 3 architectures. Based on the Flax reference implementation at google-deepmind/gemma.
See the Gemma technical reports for more information:
Gemma 1: https://arxiv.org/abs/2403.08295
Gemma 2: https://arxiv.org/abs/2408.00118
Gemma 3: https://arxiv.org/abs/2503.19786
Functions
|
Builds a Gemma model from a pretrained checkpoint. |