Skip to content

Commit

Permalink
Add mamba-minimal
Browse files Browse the repository at this point in the history
- Add stateless forward impl.
  - Efficient for training (but training is not yet implemented).
  - Input requires the entire sequence, and requires no state cache.
  - Generates one output for each input sequence.
- Add stateful forward impl.
  - Efficient for inference.
  - Input requires the last single sequence point, and requires the last state cache.
  - Generates a single output referring to the last input.
  • Loading branch information
swfsql committed Feb 9, 2024
1 parent 8cc2c9e commit ce6d624
Show file tree
Hide file tree
Showing 2 changed files with 1,108 additions and 0 deletions.
Loading

0 comments on commit ce6d624

Please sign in to comment.