--- license: mit base_model: - Qwen/Qwen3-4B-Instruct-2507 --- ## LiteCoder-4b-Terminal-preview **LiteCoder-4b-Terminal-preview** is part of our series of models specialized in terminal-based interactions and stems from our recent efforts to develop capable small and medium-sized code agent models. The model is fine-tuned from ` Qwen3-4B-Instruct-2507` on the [LiteCoder-SFT-Terminal-preview](https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview) dataset. **Notably, this model achieves competitive results using fewer than 1,000 training samples.** By relying entirely on a fully synthetic pipeline—without converting any existing datasets—we were able to secure significant gains on the challenging Terminal Bench, matching the performance of leading open-source models with extreme data efficiency. ## Released Artifacts | 2025/12/17 | | | | --- | --- | --- | | LiteCoder-4b-Terminal-preview | Model | https://huggingface.co/Lite-Coder/LiteCoder-4b-Terminal-preview | | LiteCoder-SFT-Terminal-preview | Dataset | https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview | ## Results Our models achieve competitive results on **Terminal Bench**, significantly outperforming general-purpose models of similar (and even larger) sizes. **Terminal Bench 1.0 Performance** | **Model** | **Agent** | **Results** | | --- | --- | --- | | **LiteCoder-30a3b-Terminal-preview** | Terminus 2 | **18.75%** | | Qwen3-30B-A3B-Nex-N1 | Terminus 2 | 18.75% | | **LiteCoder-4b-Terminal-preview** | Terminus 2 | **13.75%** | | Qwen3-30B-A3B-Instruct | Terminus 2 | 12.5% | | Qwen3-4B-Instruct | Terminus 2 | 5.0% | **Terminal Bench 2.0 Performance** | **Model** | **Agent** | **Results** | | --- | --- | --- | | **LiteCoder-30a3b-Terminal-preview** | Terminus 2 | **5.6%** | | **LiteCoder-4b-Terminal-preview** | Terminus 2 | **3.3%** | | Qwen3-32B | Terminus 2 | 1.9% | | InternLM3-8B-Nex-N1 | Terminus 2 | 0% | | Qwen3-8B | Terminus 2 | 0% | ## Citation ```latex @misc{LiteCoder Team, title={LiteCoder: Advancing Small and Medium-sized Code Agents}, author={Xiaoxuan Peng and Xinyu Lu and Kaiqi Zhang and Taosong Fang and Boxi Cao and Yaojie Lu}, year={2025}, } ``` ## Future Directions - **Scaling Environments:** Expanding the diversity of Docker environments and teacher models to improve generalization. - **Agentic RL:** Implementing Reinforcement Learning specifically for multi-turn agentic workflows. ## Team & Contributions - **Xiaoxuan Peng:** Main Contributor - [Xinyu Lu](https://scholar.google.com/citations?user=_OsLG8EAAAAJ&hl=zh-CN)**:** Project Lead - **Kaiqi Zhang:** Contributor - **Taosong Fang**: Contributor - **Boxi Cao:** Contributor - **Yaojie Lu:** Contributor ## Acknowledgements LiteCoder builds upon multiple open-source projects, including [Harbor](https://github.com/laude-institute/harbor). The models are trained using [AutoAlign](https://github.com/icip-cas/AutoAlign). ## Join Us Join the discussion on our [Discord](https://discord.gg/EX9qZe8B).