-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
Pull requests: unslothai/unsloth
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Disable flex-attention defaults for Gemma3N variants
#4116
opened Feb 25, 2026 by
danielhanchen
Loading…
Fix Whisper concrete auto_model fallback in FastBaseModel.from_pretrained
#4115
opened Feb 25, 2026 by
danielhanchen
Loading…
Fix full-finetuning fp32 precision fallback for issue #4082
#4114
opened Feb 25, 2026 by
danielhanchen
Loading…
fix(Triton): ensure float32 eps in RMS LayerNorm rsqrt for HIP/ROCm
#4110
opened Feb 25, 2026 by
GoldenGrapeGentleman
Loading…
fix(ROCm): Comprehensive RDNA GPU support - fix Gemma3 NaN & add is_rdna()
#4109
opened Feb 25, 2026 by
GoldenGrapeGentleman
Loading…
Fix DDP "marked ready twice" for VLMs with CPU offload + TiledMLP
#4077
opened Feb 17, 2026 by
nepfaff
Loading…
Fix tool calling compatibility for Llama 3.2 and Phi-4
#4038
opened Feb 12, 2026 by
VedantMadane
Loading…
fix: use env-only SM100 workaround for vLLM PDL/MMA path
#4035
opened Feb 11, 2026 by
danielhanchen
Loading…
Guard TRL estimate_tokens warnings_issued writes in patched RL trainers
#4034
opened Feb 11, 2026 by
danielhanchen
Loading…
Fix Gemma3 NaN losses on ROCm by disabling torch.compile for RDNA GPUs
#4029
opened Feb 11, 2026 by
GoldenGrapeGentleman
Loading…
Fix global dequantize buffer dtype mismatch across mixed-precision loads
#4026
opened Feb 11, 2026 by
GoldenGrapeGentleman
Loading…
Make bitsandbytes optional on ROCm and add bf16 helper
#4000
opened Feb 8, 2026 by
danielhanchen
Loading…
Extend TRL experimental patching and vLLM readiness
#3984
opened Feb 5, 2026 by
danielhanchen
Loading…
Fix TRL 0.25.1+ GRPO vision crash and reward function TypeError
#3975
opened Feb 3, 2026 by
danielhanchen
Loading…
5 tasks done
Add vLLM fallback and GRPO completion normalization
#3958
opened Feb 2, 2026 by
danielhanchen
Loading…
Add vLLM‑style Runtime Metrics (Inference + Training) with Opt‑In Telemetry
#3897
opened Jan 16, 2026 by
hnxnq7
Loading…
feat: Native CGGR support for SFTTrainer (closes #3884)
#3891
opened Jan 14, 2026 by
Wilbatronic
Loading…
Add context parallelism support (SDPA only)
#3823
opened Jan 2, 2026 by
djsaunde
Loading…
1 of 2 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.