Rombos-LLM-V2.5.1-Qwen-3b is a little experiment that merges a high-quality LLM, arcee-ai/raspberry-3B, using the last step of the Continuous Finetuning method outlined in a Google document. The merge is done using the mergekit with the following parameters:
- Models: Qwen2.5-3B-Instruct, raspberry-3B
- Merge method: ties
- Base model: Qwen2.5-3B
- Parameters: weight=1, density=1, normalize=true, int8_mask=true
- Dtype: bfloat16
The model has been evaluated on various tasks and datasets, and the results are available on the Open LLM Leaderboard. The model has shown promising performance across different benchmarks.
Links