huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 25.5k
Star 128k

Code
Issues 897
Pull requests 258
Actions
Projects 26
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

897 Open 14,206 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

how to generate router_logits in moe models using model.generate()?

#31722 opened Jul 1, 2024 by Jimmy-Lu

1 of 4 tasks

Model loading OOM when using FSDP + QLoRA

#31721 opened Jul 1, 2024 by Neo9061

2 of 4 tasks

Phi3SmallForCausalLM missing?

#31719 opened Jun 30, 2024 by 0wwafa

how to remove kv cache? Feature request

Request for a new feature

#31717 opened Jun 30, 2024 by tsw123678

LLava-Next example is broken

#31713 opened Jun 29, 2024 by isidentical

1 of 4 tasks

Error when using AutoTokenizer to load local files without network

#31712 opened Jun 29, 2024 by pppppkun

2 of 4 tasks

Add bot_token attribute to PreTrainedTokenizer and PreTrainedTokenizerFast Feature request

Request for a new feature

#31709 opened Jun 29, 2024 by aw632

When I used galore, the learning rate was set to 8e-6, but the training rate was 0.001

#31707 opened Jun 29, 2024 by Minami-su

meta-llama/Llama-2-7b-chat-hf tokenizer model_max_length attribute needs to be fixed.

#31705 opened Jun 28, 2024 by rohitdwivedula

4 tasks

Unable to load models with adapter weights in offline mode

#31700 opened Jun 28, 2024 by amyeroberts

1 of 4 tasks

Any config for DeBERTa series as decoders for TSDAE? Feature request

Request for a new feature

#31688 opened Jun 28, 2024 by bobox2997

NameError: free variable 'state_dict' referenced before assignment in enclosing scope

#31685 opened Jun 28, 2024 by AllentDan

1 of 4 tasks

Whisper - list index out of range with word level timestamps Audio

#31683 opened Jun 28, 2024 by maxkvbn

2 of 4 tasks

AttributeError: 'str' object has no attribute 'shape'

#31678 opened Jun 28, 2024 by MARD1NO

4 tasks

Mismatch with epoch when using gradient_accumulation

#31677 opened Jun 28, 2024 by SangbumChoi

2 of 4 tasks

Error running inference on CogVLM2 when distributing it on multiple GPUs: Expected all tensors to be on the same device, but found at least two devices

#31676 opened Jun 28, 2024 by ghazalsaheb

2 of 4 tasks

QLORA + FSDP distributed fine-tuning failed at the end during model saving stage PEFT PyTorch FSDP

#31675 opened Jun 27, 2024 by Neo9061

2 of 4 tasks

Do we need a config to change padding_side='left before the evaluation? Feature request

Request for a new feature

#31672 opened Jun 27, 2024 by gary-young

transformers.pipeline does not load tokenizer passed as string for custom models

#31669 opened Jun 27, 2024 by chedatomasz

3 of 4 tasks

compute_metric(eval_pred) in trainer is not mini-batch

#31667 opened Jun 27, 2024 by SamYuen101234

Failed to import transformers

#31658 opened Jun 27, 2024 by MaitriSavla2003

1 of 4 tasks

[Bug] Modifying normalizer for pretrained tokenizers don't consistently work Core: Tokenization

Internals of the library; Tokenization.

#31653 opened Jun 27, 2024 by alvations

flash attention support for chatglm3-6b

#31652 opened Jun 27, 2024 by elimsjxr

adalomo is not a valid OptimizerNames optimization

#31651 opened Jun 27, 2024 by luoruijie

2 of 4 tasks

"use_safetensors" not enforced with "local_files_only", loads bin file Core: Modeling

Internals of the library; Models.

#31649 opened Jun 26, 2024 by troy-baker-aumni

4 tasks

Previous 1 2 3 4 5 … 35 36 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly