flutter_llama 1.1.2
flutter_llama: ^1.1.2 copied to clipboard
Flutter plugin for running LLM inference with llama.cpp and GGUF models on Android and iOS
Changelog #
All notable changes to this project will be documented in this file.
1.1.2 - 2025-01-28 #
Changed #
- Optimized package size by excluding debug symbols and build artifacts
- Added comprehensive .pubignore to reduce package size from 234 MB to 20 MB
1.1.1 - 2025-01-28 #
1.0.1 - 2025-01-27 #
1.0.0 - 2025-01-27 #
Added #
- Initial stable release
- Full support for llama.cpp GGUF models on iOS and macOS
- GPU acceleration via Metal
- CPU optimization via Accelerate framework
- Streaming text generation
- Batch processing support
- Multiple model format support
- Comprehensive documentation
- Example application with demos
- Integration tests