Key features
- Apple Silicon optimization for M-series and A-series chips
- High-performance inference engine faster than MLX and llama.cpp
- Swift SDK for seamless iOS and macOS application integration
- One-line model conversion and high-quality quantization pipeline
- Zero-latency local processing for offline functionality
- On-device execution ensuring complete data privacy
