The Fin AI

Enterprise

community

https://thefin.ai

The-FinAI

Activity Feed

AI & ML interests

Financial Large Language Models

Recent Activity

mirageco updated a dataset 1 day ago

TheFinAI/results

clefourrier authored a paper 19 days ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

jiminHuang authored a paper 2 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

View all activity

TheFinAI's activity

mirageco

updated a dataset 1 day ago

TheFinAI/results

Updated 1 day ago • 136

jiminHuang

updated a model 6 days ago

TheFinAI/OpenFinllm-ZH-3B

Text Generation • Updated 6 days ago

clefourrier

authored a paper 19 days ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published 21 days ago • 17

jiminHuang

authored a paper 2 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17 • 54

amstrongzyf

authored a paper 2 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17 • 54

TobyYang7

authored a paper 2 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17 • 54

lzw1008

authored a paper 2 months ago

ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model

Paper • 2403.06765 • Published Mar 11

kimd24

authored a paper 3 months ago

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20 • 52

amstrongzyf

authored a paper 4 months ago

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20 • 52

jiminHuang

authored 2 papers 4 months ago

Back to the Future: Towards Explainable Temporal Reasoning with Large Language Models

Paper • 2310.01074 • Published Oct 2, 2023 • 2

No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks

Paper • 2403.06249 • Published Mar 10 • 2

TobyYang7

authored a paper 4 months ago

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20 • 52

jiminHuang

authored 2 papers 4 months ago

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20 • 52

Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models

Paper • 2310.00566 • Published Oct 1, 2023 • 1

lzw1008

authored a paper 5 months ago

EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis

Paper • 2401.08508 • Published Jan 16 • 1

clefourrier

authored 2 papers 6 months ago

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8 • 8

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 185

clefourrier

posted an update 8 months ago

Post

5446

In a basic chatbots, errors are annoyances. In medical LLMs, errors can have life-threatening consequences 🩸

It's therefore vital to benchmark/follow advances in medical LLMs before even thinking about deployment.

This is why a small research team introduced a medical LLM leaderboard, to get reproducible and comparable results between LLMs, and allow everyone to follow advances in the field.

openlifescienceai/open_medical_llm_leaderboard

Congrats to @aaditya and @pminervini !
Learn more in the blog: https://huggingface.co/blog/leaderboard-medicalllm

clefourrier

posted an update 8 months ago

Post

4427

Contamination free code evaluations with LiveCodeBench! 🖥️

LiveCodeBench is a new leaderboard, which contains:
- complete code evaluations (on code generation, self repair, code execution, tests)
- my favorite feature: problem selection by publication date 📅

This feature means that you can get model scores averaged only on new problems out of the training data. This means... contamination free code evals! 🚀

Check it out!

Blog: https://huggingface.co/blog/leaderboard-livecodebench
Leaderboard: livecodebench/leaderboard

Congrats to @StringChaos @minimario @xu3kev @kingh0730 and @FanjiaYan for the super cool leaderboard!

clefourrier

posted an update 8 months ago

Post

2209

🆕 Evaluate your RL agents - who's best at Atari?🏆

The new RL leaderboard evaluates agents in 87 possible environments (from Atari 🎮 to motion control simulations🚶and more)!

When you submit your model, it's run and evaluated in real time - and the leaderboard displays small videos of the best model's run, which is super fun to watch! ✨

Kudos to @qgallouedec for creating and maintaining the leaderboard!
Let's find out which agent is the best at games! 🚀

open-rl-leaderboard/leaderboard

AI & ML interests

Recent Activity

Team members 19

TheFinAI's activity