Skip to content

For the book and glory #4

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

mkrakovian
Copy link

@mkrakovian mkrakovian commented Apr 14, 2019

Added a spacy classifier.
The data is evaluated and tested on workers via udf functions' calls, and is only collected on driver for model training.

Multiclass classification (41 unique labels).
Model trained on 40148 records, for 10 iterations in 27min.
Model tested on 160705 records, distributed over 4 cores in 5min.

Results:
160705 total in test, 93318 predicted correctly, 0.581
Prediction by guessing categories as uniformly distributed: 0.024

@nikolaypavlov
Copy link
Owner

Well done, @mkrakovyan. Can you send me your email to me ~~at~~ nikolaypavlov dot com?

@mkrakovian
Copy link
Author

Done.
Thanks for the interesting workshop!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants