GemmaKVCachingState#
- class penzai.example_models.gemma.sampling_mode.GemmaKVCachingState[source]#
Bases:
StructSampling state for the key-value-caching Gemma variant.
You should not usually need to construct this on your own. Instead, it will be returned by
GemmaKVCachingTransformer.from_uncachedand updated byGemmaKVCachingTransformer.__call__.- Variables:
cache_len (int) – The length of the key-value caches along the “seq” axis.
batch_axes (dict[str, int]) – Axis names and sizes for the batch axes in the key-value caches.
kv_caches (dict[str, Any]) – A dictionary of key-value caches extracted from the model.
cache_end_index (int | jax.Array) – The current end index of the KV caches, used as the offset at which new keys and values will be inserted.
Methods
__init__(cache_len, batch_axes, kv_caches, ...)Attributes
cache_lenbatch_axeskv_cachescache_end_indexInherited Methods
(expand to view inherited methods)
attributes_dict()Constructs a dictionary with all of the fields in the class.
from_attributes(**field_values)Directly instantiates a struct given all of its fields.
key_for_field(field_name)Generates a JAX PyTree key for a given field name.
select()Wraps this struct in a selection, enabling functional-style mutations.
tree_flatten()Flattens this tree node.
tree_flatten_with_keys()Flattens this tree node with keys.
tree_unflatten(aux_data, children)Unflattens this tree node.
treescope_color()Computes a CSS color to display for this object in treescope.