Initial attempt at gemma2 tokenizer #358

city96 · 2025-11-08T18:41:08Z

This PR attempts to add support for reconstructing the text encoder for the gemma 2B model in a similar fashion to how it's done for UMT5, specifically as an alternative to PR #346 which instead keeps the custom spiece_model tensor used by comfy via a custom conversion script.

The actual PR still needs work, as the tokenizer currently isn't yet correct and doesn't match 1:1.

Initial attempt at gemma2 tokenizer

bf8cb71

city96 mentioned this pull request Nov 8, 2025

Gemma2 text model support #346

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial attempt at gemma2 tokenizer #358

Initial attempt at gemma2 tokenizer #358

Uh oh!

city96 commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Initial attempt at gemma2 tokenizer #358

Are you sure you want to change the base?

Initial attempt at gemma2 tokenizer #358

Uh oh!

Conversation

city96 commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants