Skip to content

v0.8.6 - support LongLLaMA

Compare
Choose a tag to compare
@kddubey kddubey released this 22 Nov 19:27
· 44 commits to main since this release

Breaking changes

  • Setting the internal past attribute of the cache to None now will cause an error to be raised if you try to use it again. Please use the original model instead

New features

  • Support LongLLaMA
  • repr for cached model
  • Don't check logits from Llama CPP

Bug fixes

None