Change the repository type filter
All
Repositories list
58 repositories
saev
PublicExplorer
Public[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web AgentsWebGuard
PublicOnline-Mind2Web
PublicGrokkedTransformer
PublicMind2Web-2
PublicUGround
Public[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI AgentsAutoSDT
PublicAutoSDT is a fully automatic pipeline to collect data-driven scientific coding tasks to train co-scientist models.GUI-Agents-Paper-List
PublicRedTeamCUA
PublicHippoRAG
PublicTravelPlanner
Public[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"LLM4Chem
PublicChemMCP
PublicScienceAgentBench
Public[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific DiscoveryChemToolAgent
PublicOfficial code repo for the paper "ChemToolAgent: The Impact of Tools on Language Agents for Chemistry Problem Solving" (previously "Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving")AgentSafety
PublicInsightAgent
Publichal-harness
PublicSkillWeaver
PublicWebDreamer
PublicMind2Web
Publicreversal-curse-binding
PublicCOSMO
Public[CIKM'24] Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge GraphsIn-Context-Reranking
PublicLLM-Planner
PublicKG-R3
PublicGroundCocoa
PublicMagicBrush
Public[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".SeeAct
Public