-
Notifications
You must be signed in to change notification settings - Fork 32.3k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix logging with each layer with ms-swift lora fine-tuning
#44378
opened Mar 1, 2026 by
redpanda1995
Loading…
5 tasks
Implement Tensor Parallelism (TP) support for compressed tensors quantization
#44377
opened Mar 1, 2026 by
redpanda1995
Loading…
5 tasks
Fix incorrect position_ids docstring in modeling_flash_attention_utils.py
#44374
opened Mar 1, 2026 by
ManasVardhan
Loading…
fix: handle Cython-compiled functions in get_docstring_indentation_level
#44372
opened Mar 1, 2026 by
N3u0ns
Loading…
Fix resume_from_checkpoint key mismatch for models with _checkpoint_conversion_mapping
#44366
opened Mar 1, 2026 by
jayakumarpujar
Loading…
3 tasks done
Add missing ReLU in GlmMoeDsaIndexer scoring
#44364
opened Mar 1, 2026 by
jayakumarpujar
Loading…
4 tasks done
Fix off-by-one in decode_spans causing ValueError with np.argpartition
#44363
opened Mar 1, 2026 by
jayakumarpujar
Loading…
3 tasks done
fix(tokenizer): Fix MLukeTokenizer AttributeError post-v5 refactor
#44362
opened Feb 28, 2026 by
harshaljanjani
Loading…
3 of 5 tasks
fix(sdpa): add workaround for MPS sdpa_vector_2pass_mps correctness bug
#44359
opened Feb 28, 2026 by
sxu75374
Loading…
3 tasks done
Fix ShieldGemma2 non-reproducible outputs by adding _tied_weights_keys
#44358
opened Feb 28, 2026 by
hardikmeisheri
Loading…
2 tasks done
Fix RoPE inv_freq default initialization (Issue #39753)
#44357
opened Feb 28, 2026 by
NabilMch
Loading…
4 of 5 tasks
fix: avoid redundant fp16/bf16 model casts in evaluation_loop
#44356
opened Feb 28, 2026 by
iamaber
Loading…
fix: off-by-one in decode_spans causes ValueError when topk == len(scores)
#44354
opened Feb 28, 2026 by
sxu75374
Loading…
fix: suppress ANSI escape codes when stdout is not a terminal
#44352
opened Feb 28, 2026 by
giulio-leone
Loading…
fix: support linear_attention in continuous batching and fix serve ch…
#44349
opened Feb 28, 2026 by
zzc0430
Loading…
3 tasks done
Enable MetalConfig to load pre-quantized MLX models from HuggingFace Hub
#44348
opened Feb 28, 2026 by
n0kovo
Loading…
3 of 4 tasks
fix: resolve composite config in PagedAttentionCache and group_layers_by_attn_type
#44347
opened Feb 27, 2026 by
sxu75374
Loading…
fix(deepseek): pass config.rms_norm_eps to MLA q/kv layernorms
#44346
opened Feb 27, 2026 by
sxu75374
Loading…
2 of 4 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.