Resources for hybrid preferences research where we learn how to route preference instances for human vs. AI feedback
Lj V. Miranda
ljvmiranda921
AI & ML interests
NLP - multilinguality, data-centric AI
Recent Activity
updated
a collection
about 13 hours ago
OLMoE on-policy for DPO
updated
a collection
about 13 hours ago
OLMoE on-policy for DPO
updated
a collection
about 13 hours ago
OLMoE on-policy for DPO
Organizations
models
26
ljvmiranda921/tl_calamancy_lg
Token Classification
•
Updated
•
8
ljvmiranda921/tl_calamancy_md
Token Classification
•
Updated
•
320
ljvmiranda921/tl_calamancy_trf
Token Classification
•
Updated
•
10
ljvmiranda921/tl_calamancy_md-0.1.0
Token Classification
•
Updated
•
105
ljvmiranda921/tl_gliner_large
Token Classification
•
Updated
•
3
ljvmiranda921/tl_gliner_medium
Token Classification
•
Updated
•
6
ljvmiranda921/tl_gliner_small
Token Classification
•
Updated
•
5
ljvmiranda921/tl_calamancy_lg-0.1.0
Token Classification
•
Updated
•
57
•
1
ljvmiranda921/tl_calamancy_trf-0.1.0
Token Classification
•
Updated
•
15
•
5
ljvmiranda921/zh_lzh_sigtyp_trf
Token Classification
•
Updated
•
9