Skip to content
View xf-zhao's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report xf-zhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
xf-zhao/README.md

Hi there ๐Ÿ‘‹

  • ๐Ÿ‘‹ Hi, Iโ€™m Xufeng Zhao
  • ๐Ÿ“ซ Iโ€™m now a 3rd year PhD student at University of Hamburg (UHH)
  • ๐Ÿ’ผ Previously worked for 2 years in JD.COM
  • ๐Ÿ‘€ Iโ€™m interested in Robotics, Large Language Models (LLMs), Reinforcement Learning (RL)
  • ๐Ÿ’ฌ Contact me for any discussion about LLMs + RL + Robotics...
  • ๐ŸŒฑ Check out some of my recent publications & implementation below:)

xf-zhao's GitHub stats

Pinned Loading

  1. Matcha-agent Matcha-agent Public

    Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268

    Python 21 2

  2. mengdi-li/awesome-RLAIF mengdi-li/awesome-RLAIF Public

    A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

    93 4

  3. LoT LoT Public

    Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"

    Python 9

  4. Agentic-Skill-Discovery Agentic-Skill-Discovery Public

    Official implementation of Zero-Hero paper

    Python 2