CobraMamba's picture

CobraMamba

CobraMamba

·

https://github.com/chi2liu

633WHU

AI & ML interests

None yet

Organizations

None yet

New activity in ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts 10 months ago

How to Only compress non-shared experts within transformer blocks?

#1 opened 10 months ago by

New activity in CobraMamba/mamba-gpt-7b 10 months ago

Adding `safetensors` variant of this model

#2 opened over 1 year ago by

New activity in CobraMamba/mamba-gpt-7b-v2 10 months ago

Adding `safetensors` variant of this model

#2 opened over 1 year ago by

New activity in CobraMamba/mamba-gpt-7b-v1 10 months ago

Base Model

#2 opened about 1 year ago by

Adding `safetensors` variant of this model

#3 opened about 1 year ago by

New activity in CobraMamba/mamba-gpt-7b-v1 about 2 years ago

Adding Evaluation Results

#1 opened over 2 years ago by

leaderboard-pr-bot

New activity in CobraMamba/mamba-gpt-7b-v2 about 2 years ago

Adding Evaluation Results

#1 opened over 2 years ago by

leaderboard-pr-bot

New activity in CobraMamba/mamba-gpt-3b-v4 over 2 years ago

"The following is the performance under 0-shot testing, mostly better than acrastt/Marx-3B-V2"

#4 opened over 2 years ago by

Update README.md

#3 opened over 2 years ago by

Model card?

#2 opened over 2 years ago by

Adding `safetensors` variant of this model

#1 opened over 2 years ago by

New activity in CobraMamba/mamba-gpt-3b-v3 over 2 years ago

Prompt template?

#4 opened over 2 years ago by

Adding `safetensors` variant of this model

#3 opened over 2 years ago by

what is the difference between v2 and v3?

#2 opened over 2 years ago by

New activity in CobraMamba/mamba-gpt-3b-v2 over 2 years ago

ValueError: expected sequence of length 35 at dim 1 (got 22)

#3 opened over 2 years ago by

New activity in open-llm-leaderboard/open_llm_leaderboard over 2 years ago

mamba-gpt-3b-v2 is the Best 3B Model! Surpassing dolly-v2-12b

#137 opened over 2 years ago by

New activity in CobraMamba/mamba-gpt-3b-v2 over 2 years ago

Adding `safetensors` variant of this model

#2 opened over 2 years ago by

GGML version

#1 opened over 2 years ago by

New activity in CobraMamba/mamba-gpt-3b over 2 years ago

Adding `safetensors` variant of this model

#1 opened over 2 years ago by

New activity in medalpaca/medalpaca-7b over 2 years ago

does anybody have solution to this issue:TypeError: forward() got an unexpected keyword argument 'token_type_ids'

#3 opened over 2 years ago by