CobraMamba
CobraMamba
AI & ML interests
None yet
Organizations
None yet
How to Only compress non-shared experts within transformer blocks?
1
#1 opened 10 months ago
by
CobraMamba
Adding `safetensors` variant of this model
#2 opened over 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#2 opened over 1 year ago
by
SFconvertbot
Base Model
1
#2 opened about 1 year ago
by
Shameless111
Adding `safetensors` variant of this model
#3 opened about 1 year ago
by
SFconvertbot
Adding Evaluation Results
#1 opened over 2 years ago
by
leaderboard-pr-bot
Adding Evaluation Results
#1 opened over 2 years ago
by
leaderboard-pr-bot
"The following is the performance under 0-shot testing, mostly better than acrastt/Marx-3B-V2"
1
#4 opened over 2 years ago
by
acrastt
Update README.md
1
#3 opened over 2 years ago
by
acrastt
Model card?
1
#2 opened over 2 years ago
by
acrastt
Adding `safetensors` variant of this model
#1 opened over 2 years ago
by
SFconvertbot
Prompt template?
1
#4 opened over 2 years ago
by
Samdeman123124
Adding `safetensors` variant of this model
#3 opened over 2 years ago
by
SFconvertbot
what is the difference between v2 and v3?
1
#2 opened over 2 years ago
by
CUIGuy
ValueError: expected sequence of length 35 at dim 1 (got 22)
1
#3 opened over 2 years ago
by
CUIGuy
mamba-gpt-3b-v2 is the Best 3B Model! Surpassing dolly-v2-12b
👍
2
1
#137 opened over 2 years ago
by
CobraMamba
Adding `safetensors` variant of this model
#2 opened over 2 years ago
by
SFconvertbot
GGML version
👍
1
1
#1 opened over 2 years ago
by
s3nh
Adding `safetensors` variant of this model
#1 opened over 2 years ago
by
SFconvertbot
does anybody have solution to this issue:TypeError: forward() got an unexpected keyword argument 'token_type_ids'
5
#3 opened over 2 years ago
by
warfaisal