The compute already exists. It's just idle.

    Lilac is a network of idle GPUs. Run inference, reserve clusters, fine-tune models, and process batch jobs on unused capacity, or join the network to automatically monetize downtime in your cluster.

    Trusted by

    Products

    One GPU Network.
    Four ways to use Compute.

    Inference

    Frontier models, idle-GPU prices

    Pay per token for open frontier models on warm idle GPUs. Point your OpenAI SDK at Lilac and go, with no contracts or minimums.

    6+ modelsGet started

    MiniMax M2.7

    Speed
    -tok/s
    TTFT
    -
    Input /M
    $0.30
    Cache /M
    $0.055
    Output /M
    $1.20
    200KFP8

    MiniMax M3

    Speed
    -tok/s
    TTFT
    -
    Input /M
    $0.28
    Cache /M
    $0.05
    Output /M
    $1.10
    1MFP8

    Kimi K2.6

    Speed
    -tok/s
    TTFT
    -
    Input /M
    $0.70
    Cache /M
    $0.20
    Output /M
    $3.50
    262KINT4

    GLM 5.1

    Speed
    -tok/s
    TTFT
    -
    Input /M
    $0.90
    Cache /M
    $0.27
    Output /M
    $3.00
    203KFP8/NVFP4

    GLM 5.2

    Speed
    -tok/s
    TTFT
    -
    Input /M
    $0.90
    Cache /M
    $0.27
    Output /M
    $3.00
    524KFP8/NVFP4

    Gemma 4 (31B)

    Speed
    -tok/s
    TTFT
    -
    Input /M
    $0.11
    Cache /M
    --
    Output /M
    $0.35
    262KFP8

    Subscriptions

    Credits that stretch on idle supply

    A flat monthly price for a pool of inference credits. When GPUs sit idle, idle-supply discounts let those credits buy up to 12x more.

    Up to 12x valueGet started

    Batch

    Run containers to completion

    Submit a container image and a command. Lilac runs it on spare GPU capacity with pricing at $1/hr H100s and $1.50/hr H200s.

    Priced per secondContact us

    Clusters

    Cluster quotes, brokered for free

    Tell us the GPU type and scale. We source competitive quotes from our neo-cloud partners and help you offset the cost with idle-GPU software.

    H100 to B300Contact us

    Reserve H100s

    Pricing is estimated. We gather quotes in real time.

    *Estimate pricing

    Idle GPU Network

    The compute is already on.
    Put it to work.