Veni AI

AI21 Labs Jamba 1.5 Large

AI21 Jamba 1.5 Large is a long-context, hybrid MoE chat model from AI21 Labs with a 256K token window and tool-use features.

Try Now
SCROLL
01

What is AI21 Jamba 1.5 Large?

AI21 Jamba 1.5 Large is a long-context AI model from AI21 Labs built on a hybrid Mamba-Transformer Mixture-of-Experts architecture designed for efficient processing across long context lengths. Jamba 1.5 Large supports a 256K token context window and developer-ready features like structured JSON output, function calling, document digestion, and grounded generation with citations. It is available through major cloud AI catalogs, including Azure AI Models, Amazon Bedrock, and Google Vertex AI.

02

Technical Specifications

Context Window

256K tokens

Max Output

Not disclosed

Training Cutoff

Not disclosed

Active

Active

03

Capabilities

Chat completion
Long-context processing (256K context window)
Structured JSON output
Function calling / tool use
Document digestion
Grounded generation with citations
05

Pros & Cons

Pros

  • 256K context window for long-document workflows
  • Hybrid MoE architecture optimized for long context
  • Structured JSON output and function calling
  • Broad availability across major cloud AI catalogs

Cons

  • Text-only input/output
  • Open model license terms apply
06

Features

01

256K Context Window

Handle long documents and multi-step workflows with a 256K token context window.

02

Hybrid MoE Architecture

Combines Mamba (SSM) and Transformer blocks for long-context efficiency and quality.

03

Developer-Ready Tooling

Supports structured JSON output and function calling for reliable tool use.

04

Grounded Outputs

Supports grounded generation with citations for verifiable outputs.

07

Use Cases

01

Long-Document Analysis

Summarize and synthesize large documents, reports, and research packs.

02

Structured Data Extraction

Extract entities and facts into strict JSON schemas for downstream systems.

03

Tool-Using Assistants

Build agent workflows with function calling and reliable tool execution.

04

RAG with Citations

Ground responses in documents and return citations for compliance workflows.

05

Multilingual Support

Serve global users across a broad set of supported languages.

09

FAQ