PHANTOM
🇮🇳 IN
Skip to content

Pull requests: hiyouga/LlamaFactory

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[v1] add qwen3 templates and fix rendering plugin.
#10212 opened Feb 25, 2026 by xvxuopop Loading…
1 of 2 tasks
[V1] add seed for training and fix gradient checkpointing
#10211 opened Feb 24, 2026 by jiaqiw09 Loading…
1 of 2 tasks
[WIP][V1] fix meta init training for full/freeze/lora
#10210 opened Feb 24, 2026 by jiaqiw09 Loading…
1 of 2 tasks
feat: add LightOnOCR-2 integration for LoRA/QLoRA fine-tuning
#10192 opened Feb 16, 2026 by johnlockejrr Loading…
2 tasks
Fix memory leak on MPS by explicitly clearing cache in trainer step
#10190 opened Feb 14, 2026 by asebaq Loading…
1 of 2 tasks
[v1] Add hyperparams and training docs
#10188 opened Feb 13, 2026 by frozenleaves Loading…
[deps] Add libibverbs for RDMA support
#10185 opened Feb 12, 2026 by RossCZ Loading…
1 of 2 tasks
Feature: experimental fine-tuning comparison
#10172 opened Feb 6, 2026 by caterina0718 Loading…
Add Trackio Integration for LlamaFactory
#10165 opened Feb 4, 2026 by ParagEkbote Loading…
1 of 2 tasks
[feat] Add DeepSpeed ZeRO-3 LoRA checkpoint save support
#10124 opened Jan 22, 2026 by kimberlykang Loading…
2 tasks done
[model] support NVIDIA's Audio-Flamingo-3 audio model
#9740 opened Jan 9, 2026 by vovanphuc Loading…
4 tasks done
Add entropy logging for SFT training path
#9717 opened Jan 5, 2026 by pankd Loading…
Support loss_mask in dataset to control loss calculation for specific turns solved This problem has been already solved
#9630 opened Dec 18, 2025 by CjangCjengh Loading…
2 tasks
Add hf_infer script for inference using HuggingFace backend pending This problem is yet to be addressed
#9370 opened Oct 29, 2025 by WinterShiver Loading…
1 of 2 tasks
support pre-tokenized parquet datasets pending This problem is yet to be addressed
#9351 opened Oct 25, 2025 by AbdulmalikDS Loading…
2 of 3 tasks
Implement LoRA for MoE with support for LoRA injection for nn.parameters pending This problem is yet to be addressed
#9337 opened Oct 23, 2025 by Ziheng-Zhang-AUS Loading…
2 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.