Anton Kratz

akratz

AI & ML interests

None yet

Recent Activity

Organizations

None yet

akratz's activity

upvoted an article about 1 month ago
view article
Article

Merge Large Language Models with mergekit

By mlabonne •
• 101
commented on Merge Large Language Models with mergekit about 1 month ago
view reply

Awesome article. It seems to me that only models with identical architecture (e.g., same number of layers, hidden dimensions, attention heads) can be merged with this approach. Is that correct? How do you know which models have identical architectures?

New activity in ctheodoris/Geneformer almost 2 years ago
New activity in ctheodoris/Genecorpus-30M almost 2 years ago

nonzero median

1
#1 opened almost 2 years ago by
akratz
New activity in bigscience/bloom-book almost 2 years ago

🚩 Report : Not working

2
#9 opened almost 2 years ago by
TempoNaoTenho
New activity in bigscience/bloom almost 2 years ago