Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compare with GPT-4 Advanced Data Analysis (integrating with Python) #6

Open
aadnk opened this issue Oct 26, 2023 · 1 comment
Open

Comments

@aadnk
Copy link

aadnk commented Oct 26, 2023

This wasn't tested in the original paper, but I've found GPT+4 with Python (Advanced Data Analysis) is often capable of solving these kinds of problems by writing a Python program that finds the solution in the search space:

Here, GPT wrote a Python program that tries all permutations of the expression (A^B)^(C^D), where "^" are one of the four basic arithmetic operators (+,-,*,/) and A, B, C, D are a permutation of the given numbers.

It then found the solution (14 - 8) * (8 / 2) = 24, which is correct. And this is in a relatively small number of tokens (input = 112 tokens, output = 512 + prompt for Advanced Data Analysis >= 624 tokens), whereas AoT would likely require far more (the openai.logs file in this repo, for instance, is 15 306 tokens).

Upvote & Fund

  • We're using Polar.sh so you can upvote and help fund this issue.
  • We receive the funding once the issue is completed & confirmed by you.
  • Thank you in advance for helping prioritize & fund our backlog.
Fund with Polar
@aadnk aadnk changed the title Compare with integrating with Python Compare with GPT-4 Advanced Data Analysis (integrating with Python) Oct 26, 2023
@robinsonkwame
Copy link

I'm not sure this issue is helpful given that there some cases where Algorithm of Thoughts may likely be better than OpenAI's current offerings (for now). Instead of finding where AoT is worse it's better to fund searches for where it is better.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants