Skip to content
View Duguce's full-sized avatar
πŸ‘Š
Focusing
πŸ‘Š
Focusing

Block or report Duguce

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Duguce/README.md

πŸ‘‹ About Me

πŸ‘¨β€πŸŽ“ Hi there! I'm a third-year master's student at Shanghai University. My main interests are in machine learning and natural language processing. Currently, my research focuses on the reliable evaluation of large language models (LLMs).

πŸ“« Contact

Pinned Loading

  1. IAAR-Shanghai/xFinder IAAR-Shanghai/xFinder Public

    [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation

    Python 175 7

  2. IAAR-Shanghai/GuessArena IAAR-Shanghai/GuessArena Public

    [ACL 2025] GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning

    Python 5

  3. mazzzystar/TurtleBench mazzzystar/TurtleBench Public

    TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.

    Jupyter Notebook 149 9