Commit History
Fix(model loading): Warn when model revision is passed to gptq (#364)
96bd6ae
unverified
Nanobit
commited on
Feat: Add rope scaling (#343)
b521206
unverified
Nanobit
commited on
Merge pull request #356 from tmm1/load_model-args
11ddccb
unverified
tmm1
commited on
simplify load_model signature
7181022
tmm1
commited on
log GPU memory usage
e303d64
tmm1
commited on
ensure enable_input_require_grads is called on model before getting the peft model (#345)
176b888
unverified
winglian
commited on
experimental llama 2 chat support (#296)
3392270
unverified
Jan Philipp Harries
Jan Philipp Harries
commited on
optimize the iteration when tokenizeing large datasets (#332)
fe28543
unverified
winglian
commited on
fix typo
2eda9e0
tmm1
commited on
scope flash-attn+qlora fix correctly, scope to llama, add comment
78b9efb
tmm1
commited on
move flash-attn monkey patch alongside the others
312a9fa
tmm1
commited on
ensure flash-attn fixes happen in both adapter/lora modes, and use torch_dtype
248bf90
tmm1
commited on
qlora w flash attention fixes (#333)
77085ea
unverified
winglian
commited on
add peft install back since it doesn't get installed by setup.py (#331)
db2a358
unverified
winglian
commited on
don't resize embeddings to multiples of 32x by default
1066751
winglian
commited on
fix axolotl training args dataclass annotation
ebaec3c
winglian
commited on
Merge pull request #276 from theobjectivedad/logging_enhancement
6f16c45
unverified
winglian
commited on
Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var
b1f4f7a
theobjectivedad
commited on
Merge branch 'OpenAccess-AI-Collective:main' into logging_enhancement
83237b8
unverified
The Objective Dad
commited on
Add ability to pass 'name' argument to load_dataset
88089e8
chargoddard
commited on
Merge pull request #274 from OpenAccess-AI-Collective/NanoCode012-patch-2
168a7a0
unverified
Nanobit
commited on
Adding logging enhancement
553a86b
theobjectivedad
commited on
Feat: Add save_safetensors
5491278
Nanobit
commited on
Set push to hub as private by default
1514739
unverified
Nanobit
commited on
support for loading a model by git revision
69a2350
winglian
commited on
Merge branch 'main' into quadratic-warmup
c4cf567
unverified
winglian
commited on
better configuration for quadratic warmup
c49729d
winglian
commited on
params are adam_*, not adamw_*
19cf0bd
winglian
commited on
skip explicit model type too if using trust_remote_code
d69da99
winglian
commited on
don't use llama if trust_remote_code is set since that needs to use AutoModel path
66afb76
winglian
commited on
Merge pull request #221 from utensil/local_dataset
b9b7d4c
unverified
winglian
commited on
Fix future deprecation push_to_hub_model_id
e79c8e6
Nanobit
commited on
Merge pull request #224 from OpenAccess-AI-Collective/system-prompt-data
f150c02
unverified
winglian
commited on
push intermediate model checkpoints to hub
612aabd
winglian
commited on
add tests and supoort for loader for sys prompt data
3a38271
winglian
commited on
optionally define whether to use_fast tokenizer
47d601f
winglian
commited on
Support loading data files from a local directory
9bdd30c
utensil
commited on
add validation and tests for adamw hyperparam
cb9d3af
winglian
commited on
support adamw and grad norm hyperparams
6d0ee4b
winglian
commited on
add float16 docs and tweak typehints
88e17ff
winglian
commited on
style correction
136522f
maciej.karasek
commited on
issue #205 bugfix
556fe40
maciej.karasek
commited on
add axolotl trainer and quadratic warmup
7dc580b
winglian
commited on
Merge branch 'main' into flash-optimum
fd2c981
unverified
winglian
commited on
Merge pull request #187 from OpenAccess-AI-Collective/strip-peft-device-map
93dacba
unverified
winglian
commited on
Merge pull request #177 from NanoCode012/fix/landmark-patch
8002ffb
unverified
winglian
commited on
Merge pull request #192 from OpenAccess-AI-Collective/sharegpt-custom-prompt
74ef5cc
unverified
winglian
commited on
Merge branch 'main' into strip-peft-device-map
5e616d9
unverified
winglian
commited on
Merge pull request #159 from AngainorDev/patch-1
8e568bb
unverified
Nanobit
commited on