Back to blog
    May 1, 20261 min read

    Kimi K2.6 is live on Lilac

    By Lucas Ewing


    TL;DR

    Kimi K2.6 is now live on Lilac as an OpenAI-compatible shared endpoint.

    ModelInputCache readOutputContext
    Kimi K2.6$0.70 / M tokens$0.20 / M tokens$3.50 / M tokens262K

    No contracts, no reserved-capacity markup, no minimums. Use the same OpenAI SDK and point it at Lilac.


    Why Kimi K2.6

    Kimi K2.6 is Moonshot AI's latest Kimi model for coding, long-context execution, tool use, and agent workflows. The official Kimi launch page describes it as a model for building real products, coordinating agent work, and handling complex tasks with stronger reliability.

    Read the official model page here: Kimi K2.6 by Moonshot AI.

    Pricing

    Kimi K2.6 on Lilac uses three token rates:

    • Input — $0.70 / M fresh input tokens
    • Cache read — $0.20 / M repeated input tokens served from cache
    • Output — $3.50 / M generated tokens

    Cache-read pricing matters for coding agents, tool-heavy workflows, and long-context chat sessions where most of the prompt context repeats across turns.

    OpenAI-compatible API

    from openai import OpenAI
    
    client = OpenAI(
        base_url="https://api.getlilac.com/v1",
        api_key="lilac_sk_...",
    )
    
    response = client.chat.completions.create(
        model="moonshotai/kimi-k2.6",
        messages=[
            {"role": "user", "content": "Build a TypeScript helper for this API."},
        ],
    )
    

    Availability

    Kimi K2.6 is available now on Lilac's shared warm endpoints. Start from the inference section, check the pricing docs, or sign up in the Lilac console.

    Keep reading

    Back to all posts