Commits · Dovakiins/qwerrwe

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553)

50421c8
unverified

Ram Ram

winglian commited on May 11, 2024

ignore the fsdp_config section too (#1606) [skip ci]

fff06af
unverified

winglian commited on May 9, 2024

Pass deepspeed and fsdp as None explicitly when merging adapters to allow custom device_map (#1575)

9e1480e
unverified

chiragjn commited on May 7, 2024

Gradio configuration parameters (#1591)

3367fca
unverified

marijnfs Marijn Stollenga Marijn Stollenga

winglian commited on May 6, 2024

Add debug option for RL dataset preprocessing (#1404)

cc5d31e
unverified

abhinand

Nanobit commited on Apr 30, 2024

ORPO Trainer replacement (#1551)

7d1d22f
unverified

winglian commited on Apr 19, 2024

Print versions (#1496)

4313b1a
unverified

winglian commited on Apr 9, 2024

don't use deepspeed or fsdp when merging loras (#1479)

87ca3f9
unverified

winglian commited on Apr 5, 2024

ORPO (#1419)

2ea70eb
unverified

winglian commited on Mar 18, 2024

more fixes 20240228 (#1342) [skip ci]

0f985e1
unverified

winglian commited on Feb 28, 2024

hotfix for capabilities loading (#1331)

7de912e
unverified

winglian commited on Feb 26, 2024

Pydantic 2.x cfg (#1239)

cc3cebf
unverified

winglian commited on Feb 26, 2024

add support for https remote yamls (#1277)

9bca7db
unverified

hamel commited on Feb 9, 2024

Peft deepspeed resume (#1227)

c67fb71
unverified

winglian commited on Jan 31, 2024

make sure to register the base chatml template even if no system message is provided (#1207)

badda37
unverified

winglian commited on Jan 25, 2024

Feat/chatml add system message (#1117)

98b4762
unverified

mhenrichsen Mads Henrichsen

winglian commited on Jan 25, 2024

more dpo fixes for dataset loading and docs (#1185) [skip ci]

5bce45f
unverified

winglian commited on Jan 24, 2024

Fix generation_config validation raises Exception for do_merge_lora (#1184)

02f2c72
unverified

tisorlawan commited on Jan 24, 2024

Add support for offline mode with HF_HUB_OFFLINE envvar (#1182)

71141de
unverified

James Wade

winglian commited on Jan 24, 2024

don't fail if can't cast weights due to offload when merging (#1172) [skip ci]

fb7f9b9
unverified

winglian commited on Jan 23, 2024

DPO cleanup (#1126)

7523d1f
unverified

winglian

plaguss HF staff commited on Jan 23, 2024

Add desc to map/filter (#1162)

6840381
unverified

casperhansen

winglian commited on Jan 23, 2024

jupyter lab fixes (#1139) [skip ci]

eaaeefc
unverified

winglian commited on Jan 22, 2024

Preprocess dataset size fix (#1131)

7570446
unverified

winglian commited on Jan 17, 2024

Reverse caching PR (#1115)

2202a20
unverified

casperhansen commited on Jan 13, 2024

Disable caching on `--disable_caching` in CLI (#1110)

d66b101
unverified

casperhansen

winglian commited on Jan 13, 2024

misc fixes from #943 (#1086) [skip ci]

23495a8
unverified

winglian commited on Jan 11, 2024

update sharegpt conversations when chatml chat template is set (#1075) [skip ci]

0ce1a65
unverified

winglian commited on Jan 10, 2024

Add: mlflow for experiment tracking (#1059) [skip ci]

090c24d
unverified

Johan Hansson

winglian commited on Jan 9, 2024

feature: better device mapping for large models (#918)

bdfefaf
unverified

kallewoof Karl-Johan Alm

winglian commited on Jan 5, 2024

set default for merge (#1044)

63fb3eb
unverified

hamel commited on Jan 5, 2024

RL/DPO (#935)

f243c21

winglian commited on Jan 4, 2024

Fix: bf16 support for inference (#981)

3678a6c
unverified

Tazik Shahjahan

winglian commited on Dec 29, 2023

feat: remove need to add load_in* during merge (#1017)

f6ecf14
unverified

Nanobit commited on Dec 29, 2023

remove landmark attn and xpos rope implementations (#1010)

70b46ca
unverified

winglian commited on Dec 28, 2023

Fix Deepspeed loading (#950)

5ea3aa3
unverified

winglian commited on Dec 13, 2023

ensure merged model matches the training dtype (#902)

1d21aa6
unverified

winglian commited on Nov 29, 2023

Determine FSDP/deepspeed settings on device select. (#883)

71b7ea3
unverified

user735 Karl-Johan Alm

winglian commited on Nov 29, 2023

include the suffix modified string in ascii art (#852)

614cff4
unverified

fpreiss commited on Nov 15, 2023

Feat: Added Gradio support (#812)

738a057
unverified

stillerman commited on Nov 5, 2023

Create preprocess CLI (#785)

e50ab07
unverified

casperhansen commited on Oct 26, 2023

improve handling of the prepared ds path and other cfg defaults (#701)

1c412c7
unverified

winglian commited on Oct 13, 2023

Save Axolotl config as WandB artifact (#716)

490923f
unverified

Jan Philipp Harries commited on Oct 11, 2023

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

winglian commited on Oct 3, 2023

Warn users to login to HuggingFace (#645)

85b0be2
unverified

Napuh commited on Sep 27, 2023

Fix for check with cfg and merge_lora (#600)

62a7741
unverified

winglian commited on Sep 19, 2023

prevent cli functions from getting fired on import (#581)

8dcd40a
unverified

winglian commited on Sep 15, 2023

refactor scripts/finetune.py into new cli modules (#550)

861ceca
unverified

winglian

Nanobit commited on Sep 15, 2023

Commit History

feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553) 50421c8 unverified

ignore the fsdp_config section too (#1606) [skip ci] fff06af unverified

Pass deepspeed and fsdp as None explicitly when merging adapters to allow custom device_map (#1575) 9e1480e unverified

Gradio configuration parameters (#1591) 3367fca unverified

Add debug option for RL dataset preprocessing (#1404) cc5d31e unverified

ORPO Trainer replacement (#1551) 7d1d22f unverified

Print versions (#1496) 4313b1a unverified

don't use deepspeed or fsdp when merging loras (#1479) 87ca3f9 unverified

ORPO (#1419) 2ea70eb unverified

more fixes 20240228 (#1342) [skip ci] 0f985e1 unverified

hotfix for capabilities loading (#1331) 7de912e unverified

Pydantic 2.x cfg (#1239) cc3cebf unverified

add support for https remote yamls (#1277) 9bca7db unverified

Peft deepspeed resume (#1227) c67fb71 unverified

make sure to register the base chatml template even if no system message is provided (#1207) badda37 unverified

Feat/chatml add system message (#1117) 98b4762 unverified

more dpo fixes for dataset loading and docs (#1185) [skip ci] 5bce45f unverified

Fix generation_config validation raises Exception for do_merge_lora (#1184) 02f2c72 unverified

Add support for offline mode with HF_HUB_OFFLINE envvar (#1182) 71141de unverified

don't fail if can't cast weights due to offload when merging (#1172) [skip ci] fb7f9b9 unverified

DPO cleanup (#1126) 7523d1f unverified

Add desc to map/filter (#1162) 6840381 unverified

jupyter lab fixes (#1139) [skip ci] eaaeefc unverified

Preprocess dataset size fix (#1131) 7570446 unverified

Reverse caching PR (#1115) 2202a20 unverified

Disable caching on `--disable_caching` in CLI (#1110) d66b101 unverified

misc fixes from #943 (#1086) [skip ci] 23495a8 unverified

update sharegpt conversations when chatml chat template is set (#1075) [skip ci] 0ce1a65 unverified

Add: mlflow for experiment tracking (#1059) [skip ci] 090c24d unverified

feature: better device mapping for large models (#918) bdfefaf unverified

set default for merge (#1044) 63fb3eb unverified

RL/DPO (#935) f243c21

Fix: bf16 support for inference (#981) 3678a6c unverified

feat: remove need to add load_in* during merge (#1017) f6ecf14 unverified

remove landmark attn and xpos rope implementations (#1010) 70b46ca unverified

Fix Deepspeed loading (#950) 5ea3aa3 unverified

ensure merged model matches the training dtype (#902) 1d21aa6 unverified

Determine FSDP/deepspeed settings on device select. (#883) 71b7ea3 unverified

include the suffix modified string in ascii art (#852) 614cff4 unverified

Feat: Added Gradio support (#812) 738a057 unverified

Create preprocess CLI (#785) e50ab07 unverified

improve handling of the prepared ds path and other cfg defaults (#701) 1c412c7 unverified

Save Axolotl config as WandB artifact (#716) 490923f unverified

prepared dataset caching, other misc fixes (#665) e50a64e unverified

Warn users to login to HuggingFace (#645) 85b0be2 unverified

Fix for check with cfg and merge_lora (#600) 62a7741 unverified

prevent cli functions from getting fired on import (#581) 8dcd40a unverified

refactor scripts/finetune.py into new cli modules (#550) 861ceca unverified