Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 21 days ago • 43
Large Language Model-Brained GUI Agents: A Survey Paper • 2411.18279 • Published 28 days ago • 27
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17 • 40
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17 • 40
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17 • 40 • 2
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29 • 8
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29 • 8
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29 • 8 • 3
DLI-Lab/step-wise-eval-description-with-refined-tao-raw-neg_actions Viewer • Updated Sep 21 • 102 • 33
DLI-Lab/step-wise-eval-additional-description-with-refined-tao Viewer • Updated Sep 19 • 33 • 32