Skip to content

feat: add LLM Gateway gemma-4, kimi-k2.7-code-highspeed, qwen3.5-9b, glm-5.2#2623

Open
smakosh wants to merge 1 commit into
anomalyco:devfrom
smakosh:feat/llmgateway-add-gemma4-kimi-highspeed-qwen35-glm52
Open

feat: add LLM Gateway gemma-4, kimi-k2.7-code-highspeed, qwen3.5-9b, glm-5.2#2623
smakosh wants to merge 1 commit into
anomalyco:devfrom
smakosh:feat/llmgateway-add-gemma4-kimi-highspeed-qwen35-glm52

Conversation

@smakosh

@smakosh smakosh commented Jun 16, 2026

Copy link
Copy Markdown
Contributor

Adds LLM Gateway provider entries for newly available text models.

Models added (providers/llmgateway/models/)

model base_model input output cache_read reasoning
gemma-4-31b-it google/gemma-4-31b-it 0.13 0.38 yes
gemma-4-26b-a4b-it google/gemma-4-26b-a4b-it 0.07 0.34 yes
kimi-k2.7-code-highspeed moonshotai/kimi-k2.7-code-highspeed 1.9 8 0.38 effort + interleaved
qwen3.5-9b alibaba/qwen3.5-9b 0.1 0.15 yes
glm-5.2 zhipuai/glm-5.2 1.4 4.4 0.26 interleaved

Base metadata added (models/)

  • models/moonshotai/kimi-k2.7-code-highspeed.toml — highspeed tier of kimi-k2.7-code (same weights)
  • models/alibaba/qwen3.5-9b.toml

gemma-4-* and glm-5.2 base definitions already existed in models/.

Sources

  • Pricing/capabilities for gemma-4, kimi-k2.7-code-highspeed, qwen3.5-9b from the live api.llmgateway.io/v1/models catalog.
  • glm-5.2 pricing from the Z.AI docs: input $1.4, cached input $0.26, output $4.4 (not yet live on LLM Gateway; added ahead of rollout).

bun validate passes.

🤖 Generated with Claude Code

…glm-5.2

Add provider entries for newly available LLM Gateway text models:
- gemma-4-31b-it, gemma-4-26b-a4b-it (Google, reasoning)
- kimi-k2.7-code-highspeed (Moonshot, highspeed tier of kimi-k2.7-code)
- qwen3.5-9b (Alibaba)
- glm-5.2 (Z.AI)

Adds base model metadata for kimi-k2.7-code-highspeed and qwen3.5-9b.
Pricing for gemma/kimi/qwen taken from the api.llmgateway.io catalog;
glm-5.2 pricing from the Z.AI docs (input $1.4, cache_read $0.26, output $4.4).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant