Publications

(2023). Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks.

PDF Cite Project

(2023). NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes.

PDF Cite Code Project

(2023). LLM as OS (llmao), Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem.

PDF Cite Project

(2023). War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars.

PDF Cite Code Project

(2023). Tutorial on Large Language Models for Recommendation.

Cite Code Dataset Project

(2023). How to Index Item IDs for Recommendation Foundation Models.

PDF Cite Code Project

(2023). OpenAGI: When LLM Meets Domain Experts.

PDF Cite Code Project

(2023). UP5: Unbiased Foundation Model for Fairness-aware Recommendation.

PDF Cite Project

(2021). EntQA: Entity linking as question answering.

PDF Cite Code Project