-
Notifications
You must be signed in to change notification settings - Fork 25.5k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
how to generate router_logits in moe models using model.generate()?
#31722
opened Jul 1, 2024 by
Jimmy-Lu
1 of 4 tasks
how to remove kv cache?
Feature request
Request for a new feature
#31717
opened Jun 30, 2024 by
tsw123678
Error when using AutoTokenizer to load local files without network
#31712
opened Jun 29, 2024 by
pppppkun
2 of 4 tasks
Add Request for a new feature
bot_token
attribute to PreTrainedTokenizer
and PreTrainedTokenizerFast
Feature request
#31709
opened Jun 29, 2024 by
aw632
When I used galore, the learning rate was set to 8e-6, but the training rate was 0.001
#31707
opened Jun 29, 2024 by
Minami-su
meta-llama/Llama-2-7b-chat-hf tokenizer
model_max_length
attribute needs to be fixed.
#31705
opened Jun 28, 2024 by
rohitdwivedula
4 tasks
Unable to load models with adapter weights in offline mode
#31700
opened Jun 28, 2024 by
amyeroberts
1 of 4 tasks
Any config for DeBERTa series as decoders for TSDAE?
Feature request
Request for a new feature
#31688
opened Jun 28, 2024 by
bobox2997
NameError: free variable 'state_dict' referenced before assignment in enclosing scope
#31685
opened Jun 28, 2024 by
AllentDan
1 of 4 tasks
Whisper - list index out of range with word level timestamps
Audio
#31683
opened Jun 28, 2024 by
maxkvbn
2 of 4 tasks
Mismatch with epoch when using gradient_accumulation
#31677
opened Jun 28, 2024 by
SangbumChoi
2 of 4 tasks
Error running inference on CogVLM2 when distributing it on multiple GPUs: Expected all tensors to be on the same device, but found at least two devices
#31676
opened Jun 28, 2024 by
ghazalsaheb
2 of 4 tasks
QLORA + FSDP distributed fine-tuning failed at the end during model saving stage
PEFT
PyTorch FSDP
#31675
opened Jun 27, 2024 by
Neo9061
2 of 4 tasks
Do we need a config to change Request for a new feature
padding_side='left
before the evaluation?
Feature request
#31672
opened Jun 27, 2024 by
gary-young
transformers.pipeline does not load tokenizer passed as string for custom models
#31669
opened Jun 27, 2024 by
chedatomasz
3 of 4 tasks
[Bug] Modifying normalizer for pretrained tokenizers don't consistently work
Core: Tokenization
Internals of the library; Tokenization.
#31653
opened Jun 27, 2024 by
alvations
adalomo is not a valid OptimizerNames
optimization
#31651
opened Jun 27, 2024 by
luoruijie
2 of 4 tasks
"use_safetensors" not enforced with "local_files_only", loads bin file
Core: Modeling
Internals of the library; Models.
#31649
opened Jun 26, 2024 by
troy-baker-aumni
4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.