Skip to content
  • Models
  • Providers
  • Rankings
  • Docs
  • Status
  • Announcements
  • About
  • Partners
  • Enterprise
  • Careers
  • Pricing
  • Support
  • Privacy
  • Terms
  • © 2026 OpenRouter, Inc

    Cogito V2 Preview Llama 109B

    deepcogito/cogito-v2-preview-llama-109b-moe

    Created Sep 2, 202532,767 context
    $0.18/M input tokens$0.59/M output tokens

    An instruction-tuned, hybrid-reasoning Mixture-of-Experts model built on Llama-4-Scout-17B-16E. Cogito v2 can answer directly or engage an extended “thinking” phase, with alignment guided by Iterated Distillation & Amplification (IDA). It targets coding, STEM, instruction following, and general helpfulness, with stronger multilingual, tool-calling, and reasoning performance than size-equivalent baselines. The model supports long-context use (up to 10M tokens) and standard Transformers workflows. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

    Recent activity on Cogito V2 Preview Llama 109B

    Total usage per day on OpenRouter

    Prompt
    4.57M
    Completion
    87K
    Reasoning
    451

    Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.