Team-PIXEL

university

https://github.com/xplip/pixel

Activity Feed Request to join this org

AI & ML interests

Language modelling with pixels

Recent Activity

elliottd authored a paper 12 days ago

Can Community Notes Replace Professional Fact-Checkers?

e-bug authored a paper 3 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

lyan62 authored a paper 8 months ago

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

View all activity

Team-PIXEL's activity

elliottd

authored a paper 12 days ago

Can Community Notes Replace Professional Fact-Checkers?

Paper • 2502.14132 • Published 18 days ago • 5

e-bug

authored a paper 3 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 129

lyan62

authored 3 papers 8 months ago

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

Paper • 2406.11030 • Published Jun 16, 2024

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

Paper • 2406.02265 • Published Jun 4, 2024 • 7

The Role of Data Curation in Image Captioning

Paper • 2305.03610 • Published May 5, 2023

e-bug

authored a paper 8 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

e-bug

authored a paper 11 months ago

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Paper • 2404.16820 • Published Apr 25, 2024 • 17

plip

updated a Space 11 months ago

PIXEL

🐱

ilkerkesen

authored a paper about 1 year ago

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

Paper • 2311.07022 • Published Nov 13, 2023 • 1

jflotz

updated 4 datasets about 1 year ago

elliottd

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 12

plip

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 12

esalesky

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 12

jflotz

authored a paper over 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 12

jflotz

updated 2 datasets over 1 year ago

Team-PIXEL/bigrams_wiki-en_529

Viewer • Updated Oct 2, 2023 • 18.4M • 596

Team-PIXEL/bigrams_bookcorpus_529

Viewer • Updated Oct 2, 2023 • 9.81M • 444

e-bug

authored a paper almost 2 years ago

Measuring Progress in Fine-grained Vision-and-Language Understanding

Paper • 2305.07558 • Published May 12, 2023 • 1

AI & ML interests

Recent Activity

Team members 13

Team-PIXEL's activity

PIXEL