Animated brain
GitHub
Google Scholar
LinkedIn

Andy Lee

  • • master student at SIST, ShanghaiTech University
  • • research focus: Interpretability, LLM Safety & Eval
  • • open-source contributor

Latest Papers

View All →
  • The Question is the Answer: Weak-to-Strong Benchmarking

    In Submission

  • Identifying Good and Bad Neurons for Task-Level Controllable LLMs

    In Submission

  • GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents

    LanGame @ NeurIPS 2024

  • Delta-Influence: Unlearning Poisons via Influence Functions

    ATTRIB @ NeurIPS 2024

EmailX

© 2025 A-Production · icons by Icons8