LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per token, it delivers high-quality generation while maintaining low inference costs. The model fits within 32 GB of RAM, making it practical to run on consumer laptops and desktops without sacrificing capability.
33KContext Window
Category
Economy
API access
Unified context
RAG + Knowledge Base
24/7 Support
🚀 Capabilities
Streaming output
Specifications
| Provider | liquid |
| Context Window | 32,768 tokens |
| Minimum Plan | Economy |
Pricing
| Input Price | $0.0300 / 1M tokens |
| Output Price | $0.1200 / 1M tokens |
💡 With PRO subscription, cost is reduced by 20%