Commit History
ignore the fsdp_config section too (#1606) [skip ci]
fff06af
unverified
winglian
commited on
Pass deepspeed and fsdp as None explicitly when merging adapters to allow custom device_map (#1575)
9e1480e
unverified
chiragjn
commited on
ORPO Trainer replacement (#1551)
7d1d22f
unverified
winglian
commited on
Print versions (#1496)
4313b1a
unverified
winglian
commited on
don't use deepspeed or fsdp when merging loras (#1479)
87ca3f9
unverified
winglian
commited on
ORPO (#1419)
2ea70eb
unverified
winglian
commited on
more fixes 20240228 (#1342) [skip ci]
0f985e1
unverified
winglian
commited on
hotfix for capabilities loading (#1331)
7de912e
unverified
winglian
commited on
Pydantic 2.x cfg (#1239)
cc3cebf
unverified
winglian
commited on
add support for https remote yamls (#1277)
9bca7db
unverified
hamel
commited on
Peft deepspeed resume (#1227)
c67fb71
unverified
winglian
commited on
make sure to register the base chatml template even if no system message is provided (#1207)
badda37
unverified
winglian
commited on
Feat/chatml add system message (#1117)
98b4762
unverified
more dpo fixes for dataset loading and docs (#1185) [skip ci]
5bce45f
unverified
winglian
commited on
Fix generation_config validation raises Exception for do_merge_lora (#1184)
02f2c72
unverified
tisorlawan
commited on
Add support for offline mode with HF_HUB_OFFLINE envvar (#1182)
71141de
unverified
don't fail if can't cast weights due to offload when merging (#1172) [skip ci]
fb7f9b9
unverified
winglian
commited on
Add desc to map/filter (#1162)
6840381
unverified
jupyter lab fixes (#1139) [skip ci]
eaaeefc
unverified
winglian
commited on
Preprocess dataset size fix (#1131)
7570446
unverified
winglian
commited on
Reverse caching PR (#1115)
2202a20
unverified
casperhansen
commited on
Disable caching on `--disable_caching` in CLI (#1110)
d66b101
unverified
misc fixes from #943 (#1086) [skip ci]
23495a8
unverified
winglian
commited on
update sharegpt conversations when chatml chat template is set (#1075) [skip ci]
0ce1a65
unverified
winglian
commited on
Add: mlflow for experiment tracking (#1059) [skip ci]
090c24d
unverified
feature: better device mapping for large models (#918)
bdfefaf
unverified
set default for merge (#1044)
63fb3eb
unverified
hamel
commited on
RL/DPO (#935)
f243c21
winglian
commited on
Fix: bf16 support for inference (#981)
3678a6c
unverified
feat: remove need to add load_in* during merge (#1017)
f6ecf14
unverified
Nanobit
commited on
remove landmark attn and xpos rope implementations (#1010)
70b46ca
unverified
winglian
commited on
Fix Deepspeed loading (#950)
5ea3aa3
unverified
winglian
commited on
ensure merged model matches the training dtype (#902)
1d21aa6
unverified
winglian
commited on
include the suffix modified string in ascii art (#852)
614cff4
unverified
fpreiss
commited on
Feat: Added Gradio support (#812)
738a057
unverified
stillerman
commited on
Create preprocess CLI (#785)
e50ab07
unverified
casperhansen
commited on
improve handling of the prepared ds path and other cfg defaults (#701)
1c412c7
unverified
winglian
commited on
Save Axolotl config as WandB artifact (#716)
490923f
unverified
Jan Philipp Harries
commited on
prepared dataset caching, other misc fixes (#665)
e50a64e
unverified
winglian
commited on
Warn users to login to HuggingFace (#645)
85b0be2
unverified
Napuh
commited on
Fix for check with cfg and merge_lora (#600)
62a7741
unverified
winglian
commited on
prevent cli functions from getting fired on import (#581)
8dcd40a
unverified
winglian
commited on