Commit History

Initialize logging in each script
c4f250e

Joshua Lochner commited on

Do not allow predictions to miss start of video
aa018be

Joshua Lochner commited on

Fix `--no_cuda` argument for preprocessing
87b2dec

Joshua Lochner commited on

Revert model input size back to 512 tokens
721bf64

Joshua Lochner commited on

Fix conflicting `--no_cuda` argument
09cabec

Joshua Lochner commited on

Use correct logger per script
e3d3d3f

Joshua Lochner commited on

Update preprocessing script to use logging module
cfbd4d5

Joshua Lochner commited on

Add `no_cuda` argument to not use GPU
de9c8c4

Joshua Lochner commited on

Update README to include installation instructions
776c8b2

Joshua Lochner commited on

Fix button colour on dark theme
921fb1d

Joshua Lochner commited on

Remove redundant calls to change device
8981122

Joshua Lochner commited on

Add `output_as_json` argument for inference
52340fc

Joshua Lochner commited on

Adjust tokenizer input size based on model input size
9604abd

Joshua Lochner commited on

Fix typo in prediction command
39f6f81

Joshua Lochner commited on

Add transcript option to streamlit app and visual improvements
8a55e13

Joshua Lochner commited on

Show message if predictions returned, but all ignored due to filters/settings
8326048

Joshua Lochner commited on

Update README.md
bfb080b

Joshua Lochner commited on

Remove unused utilities
0e18e8c

Joshua Lochner commited on

Move `load_datasets` to train script
086ca93

Joshua Lochner commited on

Improve how transcripts are stored and how manual transcripts are segmented
583f4cf

Joshua Lochner commited on

Add boilerplate code to detect whether segment was split due to length
df35612

Joshua Lochner commited on

Revert evaluation script to use `processed_file` by default
8fc746d

Joshua Lochner commited on

Fix segmentation using binary search
de9c264

Joshua Lochner commited on

Add fallback for old transcript version
c445f1a

Joshua Lochner commited on

Fix `num_tokens` key in words
83dc695

Joshua Lochner commited on

Optimize segment generation and extraction
4b4c9f0

Joshua Lochner commited on

Abstract inference code
8b71088

Joshua Lochner commited on

Remove duplicated methods from streamlit app
a9123fa

Joshua Lochner commited on

Improve caching and downloading of classifier for predictions
fb87012

Joshua Lochner commited on

Create `ClassifierLoadError`
02e576a

Joshua Lochner commited on

Download classifier and vectorizer if not present
d7a594b

Joshua Lochner commited on

Raise ModelLoadError if model does not exist
dffef09

Joshua Lochner commited on

Update errors
0b7cd5a

Joshua Lochner commited on

Assign default model for predictions
8f0e2d8

Joshua Lochner commited on

Create LICENSE
c78435b

Joshua Lochner commited on

Update README.md
0e3177b

Joshua Lochner commited on

Create FUNDING.yml
4f980b5
unverified

Joshua Lochner commited on

Add `do_process_database` option to preprocessing script
d7b6d7f

Joshua Lochner commited on

Use `get_model_tokenizer` method from streamlit app
9a5d9ed

Joshua Lochner commited on

Hide previous output on run
e926596

Joshua Lochner commited on

Improve exceptions thrown while obtaining transcripts
0b48a99

Joshua Lochner commited on

Fix import error
dbf7b4c

Joshua Lochner commited on

Remove unused imports
bdfb4b1

Joshua Lochner commited on

Fix YouTube ID regex
df05196

Joshua Lochner commited on

Add support for entering YouTube URL into textbox
94ad7ba

Joshua Lochner commited on

Use custom caching system for loading models
c415610

Joshua Lochner commited on

Update model repo ids
85661b3

Joshua Lochner commited on

Add `--channel_id` parameter to evaluation script to run evaluation on a channel
537f2b7

Joshua Lochner commited on

Fix the reduction of overlapping segments
2782b0c

Joshua Lochner commited on

Output auto-submission link for missing segments
183ba5e

Joshua Lochner commited on