-
Notifications
You must be signed in to change notification settings - Fork 184
Open
Description
I am encountering a lot of issues with the danish stemming.
Here is some examples of some wrong stem results:
- "bøger" -> "bøg": Incorrect, as "bøger" means "books," while "bøg" means "beech tree."
- "talen" -> "tal": Incorrect, as "talen" means "the speech," while "tal" means "number."
- "moden" -> "mod": Incorrect, as "moden" means "mature," while "mod" means "courage."
- "modner" -> "modn": Incorrect, as "modner" means "ripens," while "modn" is not a valid word.
- "slangen" -> "slang": Incorrect, as "slangen" means "the snake," while "slang" means "casual language."
- "slanger" -> "slang": Incorrect, as "slanger" means "snakes," while "slang" means "casual language."
- "vinder" -> "vind": Incorrect, as "vinder" means "winner" or "wins," while "vind" means "wind."
- "ruten" -> "rut": Incorrect, as "ruten" means "the route," while "rut" is not a valid word.
- "ruter" -> "rut": Incorrect, as "ruter" means "routes," while "rut" is not a valid word.
- "haven" -> "hav": Incorrect, as "haven" means "the garden," while "hav" means "ocean."
- "haver" -> "hav": Incorrect, as "haver" means "gardens," while "hav" means "ocean."
These are some of the examples I could find, there might be more.
Metadata
Metadata
Assignees
Labels
No labels