The best Side of mamba paper
We modified the Mamba's internal equations so to simply accept inputs from, and Blend, two individual details streams. To the best of our understanding, This is actually the initial try to adapt the equations of SSMs to some eyesight task like fashion transfer devoid of demanding every other module like cross-attention or personalized normalization