Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train on a custom dataset? #6

Open
chandanasreemala opened this issue Dec 28, 2022 · 1 comment
Open

How to train on a custom dataset? #6

chandanasreemala opened this issue Dec 28, 2022 · 1 comment

Comments

@chandanasreemala
Copy link

  1. How to train the model on a custom dataset. I have seen the commit you pushed to train with voxceleb2 dataset which it downloads the data from the internet. But how can I train the model with a completely new dataset which is in my local pc?

2)This may be a dumb question but is there any cap on the maximum number of different classes(In this case people) the model can identify? Is it in the range of hundreads or thousands?

@Wadaboa
Copy link
Owner

Wadaboa commented Jan 8, 2023

Hi,

Thanks for opening the issue. Let me answer in order.

  1. I've already provide some guidelines on how to support custom datasets. You can refer to issues data set #3 and voxceleb dataset #5 for that. If you have more specific questions, I'm happy to help, but please elaborate your ask.
  2. There is no cap on the maximum number of classes the model can work with. Though, when you get to the order of thousands it is generally better to work with metric-learning-based losses (such as the ArcFace loss I mention in the README), instead of the standard classification-based losses (such as cross-entropy).

Alessio.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants