adding gold labels

733949b over 2 years ago

88.1 kB

	Single Image Deraining#Rain100H#PSNR
	Question Answering#YahooCQA#P@1
	Atari Games#Atari 2600 Private Eye#Score
	Speech Recognition#MediaSpeech#WER for Turkish
	3D Point Cloud Classification#ModelNet40#Mean Accuracy
	Image Clustering#STL-10#Train Split
	Time Series Classification#WalkvsRun#NLL
	language_modeling#Text8#Number of params
	Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Chinese#Accuracy
	Weakly-supervised 3D Human Pose Estimation#Human3.6M#3D Annotations
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Decay)
	Image-to-Image Translation#Cityscapes Labels-to-Photo#FID
	Neural Architecture Search#ImageNet#Accuracy
	Human Pose Forecasting#Human3.6M#MAR, walking, 400ms
	Face Detection#WIDER Face (Medium)#AP
	Incremental Learning#CIFAR-100 - 50 classes + 10 steps of 5 classes#Average Incremental Accuracy
	Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (60% training data)
	Text Simplification#PWKP / WikiSmall#SARI
	Network Pruning#ImageNet#Accuracy
	Line Segment Detection#York Urban Dataset#sAP10
	Visual Dialog#VisDial v0.9 val#R@10
	Link Prediction#WN18RR#MR
	Stereo-LiDAR Fusion#KITTI Depth Completion Validation#RMSE
	Question Answering#WikiHop#Test
	Colorectal Gland Segmentation:#CRAG#Dice
	Image Super-Resolution#Set14 - 4x upscaling#MOS
	Semantic Segmentation#NYU Depth v2#Mean IoU
	Fine-Grained Image Classification#DF20 - Mini#F1 - macro
	Node Classification#Squirrel#Accuracy
	Recommendation Systems#Netflix#Recall@50
	6D Pose Estimation using RGB#LineMOD#Mean ADD
	Unsupervised Machine Translation#WMT2016 German-English#BLEU
	Video Retrieval#LSMDC#text-to-video R@5
	Video Retrieval#LSMDC#text-to-video R@1
	Semantic Segmentation#S3DIS#oAcc
	Recommendation Systems#Netflix#Recall@20
	Image Classification#ImageNet ReaL#Params
	Natural Language Inference#SNLI#Parameters
	Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Precision
	language_modeling#WikiText-2#Validation perplexity
	Lipreading#LRS2#Word Error Rate (WER)
	JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#PSNR
	Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: general purpose
	Few-Shot Image Classification#Mini-ImageNet - 1-Shot Learning#Accuracy
	Image Super-Resolution#Set14 - 3x upscaling#SSIM
	Link Prediction#MovieLens 25M#Hits@10
	Supervised Video Summarization#SumMe#F1-score (Canonical)
	Fine-Grained Image Classification#Oxford 102 Flowers#Accuracy
	Panoptic Segmentation#COCO panoptic#PQ
	summarization#CNN / Daily Mail (Anonymized version)#METEOR
	Link Prediction#Citeseer#AUC
	Action Recognition#EPIC-KITCHENS-100#Action@1
	Face Detection#Annotated Faces in the Wild#AP
	Multimodal Machine Translation#Multi30K#Meteor (EN-DE)
	Image-to-Image Translation#Cityscapes Labels-to-Photo#mIoU
	Image Retrieval#Flickr30K 1K test#R@5
	Image Retrieval#Flickr30K 1K test#R@1
	Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Mean)
	Pedestrian Detection#CityPersons#Heavy MR^-2
	Data-to-Text Generation#E2E NLG Challenge#METEOR
	Atari Games#Atari 2600 Skiing#Score
	Deblurring#RealBlur-R (trained on GoPro)#PSNR (sRGB)
	Semantic Retrieval#Contract Discovery#Soft-F1
	Action Recognition#NTU RGB+D 120#Accuracy (Cross-Subject)
	Language Modelling#WikiText-103#Number of params
	Action Segmentation#50 Salads#F1@25%
	Paraphrase Identification#Quora Question Pairs#Accuracy
	Semi-Supervised Semantic Segmentation#Cityscapes 100 samples labeled#Validation mIoU
	Image Generation#CelebA 64x64#FID
	Time Series Classification#Libras#Accuracy
	Weakly-supervised 3D Human Pose Estimation#Human3.6M#Number of Frames Per View
	Robotic Grasping#Cornell Grasp Dataset#5 fold cross validation
	Referring Expression Segmentation#RefCOCO testB#IoU
	JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#PSNR-B
	Visual Navigation#Cooperative Vision-and-Dialogue Navigation#spl
	Skeleton Based Action Recognition#Kinetics-Skeleton dataset#Accuracy
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Mean)
	3D Human Pose Estimation#3DPW#MPVPE
	Action Recognition#Something-Something V1#Top 5 Accuracy
	language_modeling#Text8#Bit per Character (BPC)
	Image Generation#LSUN Bedroom 256 x 256#FID
	Deblurring#RealBlur-J (trained on GoPro)#SSIM (sRGB)
	Skeleton Based Action Recognition#NTU RGB+D#Accuracy (CS)
	relation_prediction#FB15K-237#H@1
	Video Captioning#YouCook2#METEOR
	Semantic Textual Similarity#STS Benchmark#Pearson Correlation
	Speech Recognition#LibriSpeech test-clean#Word Error Rate (WER)
	Video Retrieval#MSR-VTT#text-to-video R@10
	Knowledge Graph Completion#FB15k-237#Hits@10
	Graph Regression#ZINC 100k#MAE
	Open-Domain Question Answering#SearchQA#Unigram Acc
	Chinese Named Entity Recognition#OntoNotes 4#F1
	Scene Text Detection#Total-Text#F-Measure
	Atari Games#Atari 2600 James Bond#Score
	Time Series Classification#CMUsubject16#NLL
	Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CV I)
	Text-to-Image Generation#Multi-Modal-CelebA-HQ#LPIPS
	Graph Classification#IMDb-M#Accuracy
	Skeleton Based Action Recognition#NTU RGB+D#Accuracy (CV)
	Neural Architecture Search#CIFAR-10 Image Classification#Params
	Nested Mention Recognition#ACE 2004#F1
	JPEG Artifact Correction#LIVE1 (Quality 20 Color)#SSIM
	Entity Linking#WiC-TSV#Task 1 Accuracy: all
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Recall)
	Few-Shot Image Classification#CIFAR-FS 5-way (1-shot)#Accuracy
	Deblurring#RealBlur-R (trained on GoPro)#SSIM (sRGB)
	Action Recognition#Something-Something V2#GFLOPs
	Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Recall)
	Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R2@1
	Music Source Separation#MUSDB18#SDR (bass)
	Language Modelling#Penn Treebank (Word Level)#Params
	Object Detection#PASCAL VOC 2007#MAP
	Common Sense Reasoning#CommonsenseQA#Accuracy
	JPEG Artifact Correction#ICB (Quality 20 Color)#SSIM
	Person Re-Identification#CUHK03 detected#Rank-1
	Image Generation#ImageNet 128x128#FID
	Image Retrieval with Multi-Modal Query#Fashion200k#Recall@1
	Dependency Parsing#Penn Treebank#LAS
	Time Series Classification#AUSLAN#NLL
	Language Modelling#Hutter Prize#Number of params
	Hand Pose Estimation#NYU Hands#Average 3D Error
	Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@5
	dependency_parsing#Penn Treebank#UAS
	Visual Dialog#VisDial v0.9 val#Mean Rank
	Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@1
	Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@2
	Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Decay)
	Person Re-Identification#CUHK03#MAP
	Retinal Vessel Segmentation#CHASE_DB1#F1 score
	Grayscale Image Denoising#Urban100 sigma25#PSNR
	Image-to-Image Translation#Cityscapes Labels-to-Photo#Class IOU
	Action Recognition#Something-Something V2#Parameters
	Question Answering#Natural Questions (short)#F1
	Multivariate Time Series Forecasting#MIMIC-III#NegLL
	Brain Tumor Segmentation#BRATS-2015#Dice Score
	Paraphrase Identification#Quora Question Pairs#F1
	Image Super-Resolution#BSD100 - 3x upscaling#PSNR
	RGB-D Salient Object Detection#STERE#max E-Measure
	language_modeling#Penn Treebank#Validation perplexity
	Click-Through Rate Prediction#Criteo#Log Loss
	Action Recognition#ActivityNet#mAP
	Domain Generalization#ImageNet-R#Top-1 Error Rate
	Domain Adaptation#USPS-to-MNIST#Accuracy
	Atari Games#Atari 2600 Crazy Climber#Score
	Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (80% training data)
	Open-Domain Question Answering#Quasar#EM (Quasar-T)
	Question Answering#bAbi#Mean Error Rate
	Keypoint Detection#COCO test-challenge#AR
	Continuous Control#PyBullet Ant#Return
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#J&F
	Keypoint Detection#COCO test-challenge#AP
	Text Classification#TREC-6#Error
	Text Classification#Yelp-5#Accuracy
	Atari Games#Atari 2600 Ms. Pacman#Score
	Text Classification#AG News#Error
	Named Entity Recognition#SciERC#F1
	Image Classification#Kuzushiji-MNIST#Accuracy
	Action Recognition#HACS#Top 5 Accuracy
	Few-Shot Image Classification#Stanford Cars 5-way (5-shot)#Accuracy
	Time Series Classification#CharacterTrajectories#Accuracy
	Coreference Resolution#CoNLL 2012#Avg F1
	JPEG Artifact Correction#Classic5 (Quality 10 Grayscale)#PSNR
	Sentiment Analysis#Multi-Domain Sentiment Dataset#DVD
	Text based Person Retrieval#CUHK-PEDES#R@1
	Multi-Person Pose Estimation#COCO#Validation AP
	Text based Person Retrieval#CUHK-PEDES#R@5
	Language Modelling#WikiText-103#Validation perplexity
	Image-to-Image Translation#ADE20K Labels-to-Photos#Accuracy
	Recommendation Systems#Million Song Dataset#nDCG@100
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Recall)
	Instance Segmentation#COCO test-dev#mask AP
	Extractive Text Summarization#CNN / Daily Mail#ROUGE-1
	Action Classification#Kinetics-600#Top-5 Accuracy
	Text-to-Image Generation#Multi-Modal-CelebA-HQ#Real
	Action Segmentation#GTEA#Acc
	Self-Supervised Action Recognition#UCF101#3-fold Accuracy
	Extractive Text Summarization#CNN / Daily Mail#ROUGE-2
	3D Object Detection#KITTI Cyclists Easy#AP
	Image Generation#STL-10#Inception score
	Extractive Text Summarization#CNN / Daily Mail#ROUGE-L
	Visual Dialog#VisDial v0.9 val#R@5
	Visual Dialog#VisDial v0.9 val#R@1
	JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#SSIM
	Text Summarization#DUC 2004 Task 1#ROUGE-1
	Text Summarization#DUC 2004 Task 1#ROUGE-2
	Grayscale Image Denoising#Urban100 sigma15#PSNR
	Dense Pixel Correspondence Estimation#HPatches#Viewpoint III AEPE
	3D Part Segmentation#ShapeNet-Part#Class Average IoU
	Text Summarization#DUC 2004 Task 1#ROUGE-L
	Gesture-to-Gesture Translation#NTU Hand Digit#AMT
	RGB-D Salient Object Detection#SIP#Average MAE
	Nested Named Entity Recognition#ACE 2005#F1
	Grayscale Image Denoising#BSD68 sigma25#PSNR
	Question Answering#FQuAD#F1
	Question Answering#FQuAD#EM
	Atari Games#Atari 2600 Pong#Score
	Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (AV II)
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#MS-SSIM
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Mean)
	Photo geolocation estimation#Im2GPS#Region level (200 km)
	Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CS)
	Single Image Deraining#Test1200#SSIM
	Chinese Named Entity Recognition#MSRA#F1
	Text-to-Image Generation#Multi-Modal-CelebA-HQ#FID
	Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Accuracy (val)
	Depth Completion#KITTI Depth Completion#MAE
	Few-Shot Image Classification#Mini-Imagenet 20-way (5-shot)#Accuracy
	Person Re-Identification#Market-1501#MAP
	Recommendation Systems#MovieLens 10M#RMSE
	Action Classification#Kinetics-400#Vid acc@1
	Semantic Segmentation#S3DIS Area5#mIoU
	Action Classification#Kinetics-400#Vid acc@5
	Image Super-Resolution#Set14 - 8x upscaling#SSIM
	Anomaly Detection#One-class CIFAR-10#AUROC
	Image Retrieval#CUB-200-2011#R@1
	Node Classification#Cora#Validation
	Time Series Classification#DigitShapes#NLL
	Image Generation#CelebA-HQ 128x128#FID
	Atari Games#Atari 2600 Breakout#Score
	Action Segmentation#50 Salads#Acc
	Self-Supervised Action Recognition#HMDB51 (finetuned)#Top-1 Accuracy
	Emotion Recognition in Conversation#EmoryNLP#Weighted Macro-F1
	Language Modelling#enwik8#Number of params
	Node Classification#Brazil Air-Traffic#Accuracy
	Music Source Separation#MUSDB18#SDR (other)
	Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Recall)
	Person Search#PRW#mAP
	Sentiment Analysis#Amazon Review Polarity#Accuracy
	Deblurring#GoPro#PSNR
	Named Entity Recognition#JNLPBA#F1
	Object Detection#CrowdHuman (full body)#mMR
	Question Answering#CoQA#In-domain
	Action Segmentation#50 Salads#F1@50%
	Panoptic Segmentation#Cityscapes val#AP
	Image-to-Image Translation#SYNTHIA-to-Cityscapes#mIoU (13 classes)
	Keypoint Detection#COCO#Test AP
	Photo geolocation estimation#Im2GPS#City level (25 km)
	Fine-Grained Image Classification#Stanford Cars#Accuracy
	Trajectory Prediction#ETH/UCY#ADE-8/12
	question_answering#SearchQA#N-gram F1
	Single Image Deraining#Test2800#SSIM
	Breast Tumour Classification#PCam#AUC
	Real-Time Semantic Segmentation#Cityscapes test#Frame (fps)
	Person Re-Identification#MSMT17#Rank-1
	JPEG Artifact Correction#ICB (Quality 10 Color)#PSNR
	Unsupervised MNIST#MNIST#Accuracy
	Vision and Language Navigation#VLN Challenge#success
	3D Object Detection#KITTI Cars Moderate#AP
	Sentiment Analysis#TweetEval#Emoji
	Object Detection#iSAID#Average Precision
	language_modeling#WikiText-2#Test perplexity
	Image Super-Resolution#Urban100 - 3x upscaling#PSNR
	Panoptic Segmentation#COCO test-dev#PQ
	3D Instance Segmentation#S3DIS#mPrec
	Atari Games#Atari-57#Medium Human-Normalized Score
	Image Classification#Tiered ImageNet 5-way (5-shot)#Accuracy
	Multi-Person Pose Estimation#MPII Multi-Person#AP
	Atari Games#Atari 2600 Asteroids#Score
	Instance Segmentation#COCO test-dev#AP75
	Action Classification#AViD#Accuracy
	Face Alignment#WFLW#ME (%, all)
	Monocular 3D Human Pose Estimation#Human3.6M#Need Ground Truth 2D Pose
	Denoising#Darmstadt Noise Dataset#PSNR
	Atari Games#Atari 2600 Assault#Score
	Atari Games#Atari 2600 Time Pilot#Score
	Hand Pose Estimation#ICVL Hands#Average 3D Error
	Atari Games#Atari 2600 Robotank#Score
	Pose Estimation#COCO test-dev#APL
	Pose Estimation#COCO test-dev#APM
	Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.95
	Node Classification#Reddit#Accuracy
	Face Verification#IJB-A#TAR @ FAR=0.01
	Pose Transfer#Deep-Fashion#IS
	Atari Games#Atari 2600 Gopher#Score
	Natural Language Inference#WNLI#Accuracy
	Visual Question Answering#GQA Test2019#Binary
	Hand Pose Estimation#MSRA Hands#Average 3D Error
	Heterogeneous Node Classification#DBLP (PACT) 14k#Micro-F1 (80% training data)
	Image Matting#Composition-1K#MSE
	named_entity_recognition#CoNLL 2003 (English)#F1
	Node Classification#Europe Air-Traffic#Accuracy
	Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.75
	Atari Games#Atari 2600 Montezuma's Revenge#Score
	Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Decay)
	Real-Time Semantic Segmentation#CamVid#mIoU
	Semantic Segmentation#CamVid#Mean IoU
	Instance Segmentation#COCO test-dev#AP50
	Question Answering#OpenBookQA#Accuracy
	Speech Recognition#LibriSpeech test-other#Word Error Rate (WER)
	Link Prediction#WN18RR#Hits@3
	Panoptic Segmentation#Cityscapes val#PQ
	Link Prediction#WN18RR#Hits@1
	Click-Through Rate Prediction#Company*#Log Loss
	Video Retrieval#MSR-VTT#text-to-video Median Rank
	Nested Named Entity Recognition#ACE 2004#F1
	Color Image Denoising#Darmstadt Noise Dataset#PSNR (sRGB)
	Deblurring#HIDE (trained on GOPRO)#PSNR (sRGB)
	Image Generation#FFHQ#FID
	Video Captioning#YouCook2#CIDEr
	Session-Based Recommendations#Diginetica#MRR@20
	Optical Flow Estimation#Sintel-final#Average End-Point Error
	Skeleton Based Action Recognition#J-HMDB#Accuracy (RGB+pose)
	Action Classification#Kinetics-400#Clip acc@5
	Action Classification#Kinetics-400#Clip acc@1
	RGB-D Salient Object Detection#NLPR#max E-Measure
	3D Object Detection#KITTI Cyclists Hard#AP
	Multi-Frame Super-Resolution#PROBA-V#Normalized cPSNR
	Recommendation Systems#Flixster Monti#RMSE
	Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Mean)
	Image-to-Image Translation#COCO-Stuff Labels-to-Photos#Accuracy
	Visual Question Answering#CLEVR#Accuracy
	Egocentric Activity Recognition#EPIC-KITCHENS-55#Actions Top-1 (S2)
	Self-Supervised Image Classification#ImageNet#Top 1 Accuracy
	Click-Through Rate Prediction#Avazu#AUC
	Few-Shot Image Classification#Meta-Dataset Rank#Mean Rank
	Natural Language Inference#RTE#Accuracy
	Time Series Classification#ECG#NLL
	Image Relighting#VIDIT’20 validation set#Runtime(s)
	Domain Adaptation#Office-Home#Accuracy
	Click-Through Rate Prediction#Bing News#AUC
	Domain Generalization#PACS#Average Accuracy
	Image Super-Resolution#Set5 - 3x upscaling#PSNR
	Multivariate Time Series Imputation#MuJoCo#MSE (10^2, 50% missing)
	Color Image Denoising#Darmstadt Noise Dataset#SSIM (sRGB)
	Scene Text Detection#ICDAR 2017 MLT#F-Measure
	Image Clustering#STL-10#Accuracy
	Few-Shot Image Classification#Tiered ImageNet 5-way (5-shot)#Accuracy
	Emotion Recognition in Conversation#EC#Micro-F1
	Video Alignment#UPenn Action#Kendall's Tau
	Weakly Supervised Action Localization#ActivityNet-1.2#mAP@0.5
	Keypoint Detection#MPII Multi-Person#mAP@0.5
	Video Captioning#YouCook2#ROUGE-L
	Link Prediction#WordNet#Accuracy
	Image Classification#CIFAR-10#Percentage correct
	Single Image Deraining#Test100#SSIM
	Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#IoU
	Reading Comprehension#RACE#Accuracy (High)
	Object Detection#CrowdHuman (full body)#AP
	Text-to-Image Generation#COCO#FID
	Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#PSNR
	Anomaly Detection#MVTec AD#Detection AUROC
	Node Classification#Pubmed Full-supervised#Accuracy
	Referring Expression Segmentation#RefCoCo val#IoU
	Birds Eye View Object Detection#KITTI Cyclists Moderate#AP
	Hand Pose Estimation#HANDS 2017#Average 3D Error
	Grammatical Error Detection#CoNLL-2014 A2#F0.5
	Image Super-Resolution#Set14 - 4x upscaling#SSIM
	Continuous Control#PyBullet Hopper#Return
	Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Subtask 1+2#F1
	constituency_parsing#Penn Treebank#F1
	Image Relighting#VIDIT’20 validation set#SSIM
	Object Counting#CARPK#MAE
	Atari Games#Atari 2600 Beam Rider#Score
	Metric Learning#CUB-200-2011#R@1
	Image Generation#LSUN Bedroom 256 x 256#FID-10k-training-steps
	language_modeling#Hutter Prize#Bit per Character (BPC)
	Fact-based Text Editing#WebEdit#Exact Match
	Few-Shot Image Classification#CUB 200 5-way 5-shot#Accuracy
	Video Retrieval#MSVD#text-to-video Median Rank
	Visual Navigation#Cooperative Vision-and-Dialogue Navigation#dist_to_end_reduction
	Domain Adaptation#ImageCLEF-DA#Accuracy
	Fine-Grained Image Classification#DF20 - Mini#Top-1
	Fine-Grained Image Classification#DF20 - Mini#Top-3
	Part-Of-Speech Tagging#Penn Treebank#Accuracy
	Action Spotting#SoccerNet#Average-mAP
	Semi-Supervised Video Object Segmentation#YouTube-VOS#Jaccard (Unseen)
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#J&F
	Face Detection#PASCAL Face#AP
	Atari Games#Atari 2600 Pitfall!#Score
	Image Super-Resolution#Set5 - 4x upscaling#MOS
	Human Pose Forecasting#Human3.6M#MAR, walking, 1,000ms
	Image Clustering#Extended Yale-B#NMI
	Person Re-Identification#DukeMTMC-reID#Rank-10
	Click-Through Rate Prediction#Company*#AUC
	Link Prediction#YAGO3-10#MRR
	Image-to-Image Translation#ADE20K Labels-to-Photos#mIoU
	Text Simplification#ASSET#SARI (EASSE>=0.2.1)
	word_segmentation#PKU#F1
	Dense Pixel Correspondence Estimation#HPatches#Viewpoint IV AEPE
	Human-Object Interaction Detection#HICO-DET#mAP
	Constituency Grammar Induction#PTB#Mean F1 (WSJ)
	Spoken language identification#LRE07#Average
	word_sense_disambiguation#Senseval 2#F1
	Node Classification#Cora Full-supervised#Accuracy
	RGB Salient Object Detection#DUTS-TE#F-measure
	Video Captioning#YouCook2#BLEU-4
	Atari Games#Atari 2600 Zaxxon#Score
	Image Classification#CINIC-10#Accuracy
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#NIQE
	Image Classification#WebVision-1000#Top-5 Accuracy
	Time Series Classification#UWave#NLL
	Data-to-Text Generation#E2E NLG Challenge#NIST
	Semantic Segmentation#S3DIS Area5#oAcc
	Monocular Depth Estimation#KITTI Eigen split unsupervised#absolute relative error
	Reading Comprehension#ReClor#Test
	Anomaly Detection#MVTec AD#Segmentation AUROC
	Deblurring#HIDE (trained on GOPRO)#SSIM (sRGB)
	Link Prediction#OpenBioLink#Hits@1
	Text Classification#IMDb#Accuracy (10 classes)
	Link Prediction#OpenBioLink#Hits@3
	Pose Tracking#PoseTrack2017#mAP
	Node Classification#Cora with Public Split: fixed 20 nodes per class#Accuracy
	sentiment_analysis#SemEval-2014 Task 4 subtask 2 Aspect Term Polarity#Restaurant (acc)
	Text-to-Image Generation#COCO#Inception score
	Causal Inference#IDHP#Average Treatment Effect Error
	3D Part Segmentation#ShapeNet-Part#Instance Average IoU
	Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (20% training data)
	Face Detection#FDDB#AP
	Fine-Grained Image Classification#Oxford 102 Flowers#PARAMS
	Natural Language Inference#MultiNLI#Mismatched
	Curved Text Detection#SCUT-CTW1500#F-Measure
	Photo geolocation estimation#Im2GPS#Street level (1 km)
	Keypoint Detection#COCO#Validation AP
	Fake News Detection#FNC-1#Per-class Accuracy (Discuss)
	Cross-Modal Retrieval#Flickr30k#Text-to-image R@5
	Cross-Modal Retrieval#Flickr30k#Text-to-image R@1
	Domain Adaptation#SYNTHIA-to-Cityscapes#mIoU
	Image Generation#LSUN Churches 256 x 256#FID
	Visual Object Tracking#TrackingNet#Normalized Precision
	JPEG Artifact Correction#LIVE1 (Quality 10 Color)#PSNR-B
	AMR Parsing#LDC2017T10#Smatch
	Time Series Classification#Shapes#NLL
	Machine Translation#WMT2016 Romanian-English#BLEU score
	Ad-Hoc Information Retrieval#TREC Robust04#P@20
	Named Entity Recognition#CoNLL 2003 (English)#F1
	Time Series Classification#PenDigits#Accuracy
	JPEG Artifact Correction#LIVE1 (Quality 20 Color)#PSNR-B
	Real-Time Semantic Segmentation#Cityscapes test#mIoU
	Monocular 3D Human Pose Estimation#Human3.6M#Frames Needed
	Question Answering#DROP Test#F1
	Few-Shot Image Classification#Mini-Imagenet 10-way (1-shot)#Accuracy
	Action Recognition#HACS#Top 1 Accuracy
	language_modeling#WikiText-103#Validation perplexity
	Intent Detection#ATIS#Accuracy
	Scene Text Detection#SCUT-CTW1500#Recall
	Image Super-Resolution#Set14 - 2x upscaling#SSIM
	Node Classification#CiteSeer (1%)#Accuracy
	3D Human Pose Estimation#Total Capture#Average MPJPE (mm)
	Automated Theorem Proving#HolStep (Conditional)#Classification Accuracy
	Audio Classification#AudioSet#Test mAP
	Fact-based Text Editing#WebEdit#SARI
	Natural Language Inference#QNLI#Accuracy
	Document Image Classification#RVL-CDIP#Accuracy
	Natural Language Inference#ANLI test#A2
	Natural Language Inference#ANLI test#A1
	Natural Language Inference#ANLI test#A3
	Question Answering#Quasart-T#EM
	Image Super-Resolution#Manga109 - 3x upscaling#PSNR
	Word Sense Disambiguation#SemEval 2013 Task 12#F1
	Semantic Textual Similarity#MRPC#F1
	Object Counting#CARPK#RMSE
	Image Matting#Composition-1K#Conn
	Self-Supervised Action Recognition#UCF101 (finetuned)#3-fold Accuracy
	Multimodal Activity Recognition#Moments in Time Dataset#Top-1 (%)
	3D Semantic Instance Segmentation#ScanNetV2#mAP@0.50
	Video Super-Resolution#Vid4 - 4x upscaling#PSNR
	relation_prediction#WN18RR#H@1
	Cross-View Image-to-Image Translation#Dayton (256×256) - aerial-to-ground#SSIM
	Language Modelling#enwik8#Bit per Character (BPC)
	Hyperspectral Image Classification#Indian Pines#Overall Accuracy
	Language Modelling#One Billion Word#PPL
	Chinese Named Entity Recognition#Weibo NER#F1
	RGB-D Salient Object Detection#SIP#max E-Measure
	Question Answering#SQuAD1.1#F1
	Question Answering#SQuAD1.1#EM
	Question Answering#NarrativeQA#Rouge-L
	Person Re-Identification#PRID2011#Rank-5
	Person Re-Identification#PRID2011#Rank-1
	Language Modelling#One Billion Word#Number of params
	Image Classification#Clothing1M#Accuracy
	JPEG Artifact Correction#ICB (Quality 20 Grayscale)#PSNR
	Node Classification#BlogCatalog#Macro-F1
	Image Classification#iNaturalist 2018#Top-1 Accuracy
	RGB-D Salient Object Detection#DES#S-Measure
	Fake News Detection#FNC-1#Per-class Accuracy (Unrelated)
	Text Classification#DBpedia#Error
	Word Sense Disambiguation#SensEval 2#F1
	Link Prediction#Pubmed#AUC
	Image Denoising#DND#SSIM (sRGB)
	Video Retrieval#MSR-VTT-1kA#text-to-video Median Rank
	Image Clustering#CIFAR-10#NMI
	Scene Text Detection#ICDAR 2013#Precision
	summarization#Gigaword#ROUGE-1
	Atari Games#Atari 2600 Ice Hockey#Score
	summarization#Gigaword#ROUGE-2
	Entity Linking#WiC-TSV#Task 1 Accuracy: domain specific
	summarization#Gigaword#ROUGE-L
	Image Relighting#VIDIT’20 validation set#PSNR
	Point Cloud Registration#3DMatch Benchmark#Recall
	Machine Translation#IWSLT2015 English-Vietnamese#BLEU
	Lesion Segmentation#ISIC 2018#Dice Score
	Atari Games#Atari 2600 Freeway#Score
	Action Recognition#AVA v2.1#mAP (Val)
	Grayscale Image Denoising#Set12 sigma50#PSNR
	3D Object Detection#nuScenes#NDS
	Dialogue State Tracking#Wizard-of-Oz#Joint
	Sentiment Analysis#Multi-Domain Sentiment Dataset#Books
	Image Clustering#ImageNet-10#Accuracy
	Semantic Segmentation#Semantic3D#mIoU
	Image Clustering#Tiny-ImageNet#NMI
	Image Relighting#VIDIT’20 validation set#MPS
	Object Counting#Pascal VOC 2007 count-test#mRMSE
	JPEG Artifact Correction#ICB (Quality 10 Grayscale)#SSIM
	Crowd Counting#ShanghaiTech B#MAE
	Human-Object Interaction Detection#V-COCO#Time Per Frame(ms)
	Gesture-to-Gesture Translation#Senz3D#AMT
	3D Human Pose Estimation#3D Poses in the Wild Challenge#MPJPE
	Keypoint Detection#COCO test-dev#AR
	Image Retrieval#Par6k#mAP
	Action Recognition#Something-Something V2#Top-1 Accuracy
	Graph Regression#PCQM4M-LSC#Test MAE
	Graph Classification#PTC#Accuracy
	Visual Question Answering#VQA v2 test-dev#Accuracy
	Anomaly Detection#Numenta Anomaly Benchmark#NAB score
	Semantic Segmentation#S3DIS#Mean IoU
	Sentiment Analysis#CR#Accuracy
	Image Classification#CIFAR-10#PARAMS
	Open-Domain Question Answering#SearchQA#EM
	Fine-Grained Image Classification#FGVC Aircraft#Accuracy
	Visual Object Tracking#TrackingNet#Precision
	Music Source Separation#MUSDB18#SDR (vocals)
	Text Summarization#Pubmed#ROUGE-L
	Link Prediction#Citeseer#AP
	Drug Discovery#QM9#Error ratio
	Text Summarization#Pubmed#ROUGE-1
	Text Summarization#Pubmed#ROUGE-2
	Visual Object Tracking#GOT-10k#Average Overlap
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Mean)
	Pedestrian Detection#CityPersons#Partial MR^-2
	Visual Object Tracking#TrackingNet#Accuracy
	Multi-Person Pose Estimation#COCO#AP
	Atari Games#Atari 2600 Asterix#Score
	Image Classification#CIFAR-100#PARAMS
	Few-Shot Image Classification#Mini-Imagenet 20-way (1-shot)#Accuracy
	Cross-Lingual NER#CoNLL German#F1
	RGB-D Salient Object Detection#STERE#S-Measure
	Image Super-Resolution#Manga109 - 3x upscaling#SSIM
	Temporal Action Localization#ActivityNet-1.3#mAP
	Link Prediction#FB15k-237#Hits@10
	3D Human Pose Estimation#HumanEva-I#Mean Reconstruction Error (mm)
	Atari Games#Atari 2600 Enduro#Score
	Photo geolocation estimation#Im2GPS#Country level (750 km)
	Scene Graph Generation#Visual Genome#Recall@50
	Panoptic Segmentation#Mapillary val#PQ
	3D Instance Segmentation#ScanNet(v2)#Mean AP @ 0.5
	Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CV II)
	Text Simplification#ASSET#BLEU
	Image Clustering#coil-100#NMI
	Skeleton Based Action Recognition#SBU#Accuracy
	Colorectal Gland Segmentation:#CRAG#Hausdorff Distance (mm)
	Image Super-Resolution#BSD100 - 2x upscaling#PSNR
	6D Pose Estimation using RGB#LineMOD#Accuracy
	Speech Recognition#Switchboard + Hub500#Percentage error
	Link Prediction#FB15k#MR
	Text Simplification#Newsela#BLEU
	Data-to-Text Generation#E2E NLG Challenge#ROUGE-L
	Named Entity Recognition#GENIA#F1
	Visual Question Answering#GQA Test2019#Distribution
	Image Classification#iNaturalist 2019#Top-1 Accuracy
	Image Classification#mini WebVision 1.0#ImageNet Top-5 Accuracy
	Head Pose Estimation#BIWI#MAE (trained with other data)
	Question Answering#TrecQA#MAP
	Visual Question Answering#VQA v1 test-std#Accuracy
	Sentiment Analysis#Yelp Fine-grained classification#Error
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#FED
	Image Super-Resolution#Manga109 - 8x upscaling#SSIM
	part-of-speech_tagging#VLSP 2013 POS tagging shared task#Accuracy
	Nested Named Entity Recognition#GENIA#F1
	Hate Speech Detection#Ethos Binary#Classification Accuracy
	Machine Translation#WMT2016 English-Romanian#BLEU score
	Text based Person Retrieval#CUHK-PEDES#R@10
	Visual Question Answering#GQA Test2019#Consistency
	Image Classification#ImageNet ReaL#Accuracy
	named_entity_recognition#VLSP 2016 NER shared task#F1
	Atari Games#Atari 2600 Phoenix#Score
	Natural Language Inference#SNLI#% Train Accuracy
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#FID
	Visual Question Answering#CLEVR-Humans#Accuracy
	Image Clustering#STL-10#Backbone
	Node Classification#PubMed (0.03%)#Accuracy
	Sentiment Analysis#Yelp Binary classification#Error
	Skeleton Based Action Recognition#NTU RGB+D 120#Accuracy (Cross-Subject)
	Word Sense Disambiguation#SensEval 3 Task 1#F1
	RGB-D Salient Object Detection#NLPR#Average MAE
	Dependency Parsing#Penn Treebank#POS
	Language Modelling#Penn Treebank (Character Level)#Bit per Character (BPC)
	Few-Shot Image Classification#Mini-Imagenet 5-way (10-shot)#Accuracy
	Graph Classification#NEURON-Average#Accuracy
	Node Classification#Cora (3%)#Accuracy
	sentiment_analysis#SUBJ#Accuracy
	amr_parsing#LDC2015E86#Smatch
	Part-Of-Speech Tagging#UD#Avg accuracy
	Atari Games#Atari 2600 Wizard of Wor#Score
	Pose Tracking#PoseTrack2017#MOTA
	3D Object Reconstruction#Data3D−R2N2#3DIoU
	Real-time Instance Segmentation#MSCOCO#AP75
	Visual Question Answering#MSVD-QA#Accuracy
	Few-Shot Image Classification#Meta-Dataset#Accuracy
	Sentiment Analysis#SST-5 Fine-grained classification#Accuracy
	Image Classification#WebVision-1000#ImageNet Top-5 Accuracy
	Atari Games#Atari 2600 Atlantis#Score
	Atari Games#Atari 2600 Road Runner#Score
	Image Super-Resolution#Urban100 - 2x upscaling#PSNR
	Semantic Segmentation#LIP val#mIoU
	Real-time Instance Segmentation#MSCOCO#AP50
	Speech Recognition#WSJ eval92#Word Error Rate (WER)
	Domain Adaptation#Office-Caltech#Average Accuracy
	Relation Extraction#DocRED#F1
	Node Classification#Wiki-Vote#Accuracy
	Semi-Supervised Video Object Segmentation#DAVIS 2016#J&F
	Language Modelling#Penn Treebank (Word Level)#Validation perplexity
	3D Point Cloud Classification#ModelNet40#Overall Accuracy
	Retinal Vessel Segmentation#DRIVE#AUC
	Face Alignment#300W#AUC0.08 private
	Few-Shot Image Classification#CIFAR-FS 5-way (5-shot)#Accuracy
	3D Object Detection#ScanNetV2#mAP@0.5
	Multivariate Time Series Forecasting#MuJoCo#MSE (10^-2, 50% missing)
	Link Prediction#YAGO3-10#Hits@10
	Graph Classification#RE-M5K#Accuracy
	Image Clustering#coil-100#Accuracy
	Text-to-Image Generation#Multi-Modal-CelebA-HQ#Acc
	Multiple Object Tracking#KITTI Tracking test#MOTA
	Document Classification#Cora#Accuracy
	Semantic Textual Similarity#SentEval#SICK-R
	Fake News Detection#FNC-1#Weighted Accuracy
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Mean)
	Semantic Textual Similarity#SentEval#SICK-E
	Self-Supervised Image Classification#ImageNet#Number of Params
	Object Detection#Waymo 2D detection all_ns f0val#COCO-style AP
	Few-Shot Image Classification#OMNIGLOT - 5-Shot, 20-way#Accuracy
	Question Answering#TrecQA#MRR
	Image Classification#mini WebVision 1.0#Top-1 Accuracy
	Neural Architecture Search#NAS-Bench-201, CIFAR-100#Accuracy (Val)
	Fine-Grained Image Classification#Stanford Cars#PARAMS
	Continuous Control#PyBullet Walker2D#Return
	Image-to-Image Translation#ADE20K Labels-to-Photos#FID
	Machine Translation#IWSLT2015 German-English#BLEU score
	Image Retrieval with Multi-Modal Query#Fashion200k#Recall@10
	Time Series Classification#Wafer#NLL
	Self-Supervised Image Classification#ImageNet#Top 5 Accuracy
	Dialogue Act Classification#Switchboard corpus#Accuracy
	Time Series Classification#CMUsubject16#Accuracy
	Atari Games#Atari 2600 Bowling#Score
	Sentiment Analysis#TweetEval#Hate
	language_modeling#WikiText-2#Number of params
	Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#MS-SSIM
	3D Multi-Object Tracking#KITTI#MOTA
	Graph Classification#COLLAB#Accuracy
	Gesture-to-Gesture Translation#NTU Hand Digit#IS
	3D Multi-Object Tracking#KITTI#MOTP
	Link Prediction#Cora#AUC
	Sentiment Analysis#Multi-Domain Sentiment Dataset#Kitchen
	Image Retrieval#Oxf5k#MAP
	Text Classification#Ohsumed#Accuracy
	RGB-D Salient Object Detection#NJU2K#S-Measure
	Retinal OCT Disease Classification#OCT2017#Sensitivity
	Data-to-Text Generation#WebNLG#BLEU
	Image Retrieval with Multi-Modal Query#Fashion200k#Recall@50
	3D Object Detection#SUN-RGBD val#mAP@0.25
	Machine Translation#WMT2014 English-German#SacreBLEU
	Fact-based Text Editing#WebEdit#F1
	Few-Shot Semantic Segmentation#PASCAL-5i (1-Shot)#Mean IoU
	Time Series Classification#JapaneseVowels#NLL
	Synthetic-to-Real Translation#Syn2Real-C#Accuracy
	Few-Shot Image Classification#Stanford Cars 5-way (1-shot)#Accuracy
	Image Classification#Stanford Cars#Accuracy
	3D Instance Segmentation#ScanNet(v2)#mAP
	Coreference Resolution#OntoNotes#F1
	Image Generation#CelebA-HQ 1024x1024#FID
	Node Classification#Pubmed#Validation
	Multivariate Time Series Forecasting#USHCN-Daily#MSE
	Human-Object Interaction Detection#HICO#mAP
	Panoptic Segmentation#COCO test-dev#PQst
	Image Classification#MNIST#Percentage error
	Code Generation#WikiSQL#Execution Accuracy
	Image Super-Resolution#Urban100 - 8x upscaling#SSIM
	Relation Extraction#DocRED#Ign F1
	Panoptic Segmentation#COCO test-dev#PQth
	Object Detection#Manga109-s 15test#COCO-style AP
	Instance Segmentation#Cityscapes test#Average Precision
	Action Classification#Charades#MAP
	Interactive Segmentation#GrabCut#NoC@85
	Action Classification#Kinetics-400#Flops x views
	Image Clustering#Imagenet-dog-15#Accuracy
	Real-Time Object Detection#COCO#FPS
	Recommendation Systems#MovieLens 1M#nDCG@10
	Speech Enhancement#DEMAND#CBAK
	word_sense_disambiguation#Senseval 3#F1
	Semi-Supervised Image Classification#ImageNet - 1% labeled data#Top 1 Accuracy
	Recommendation Systems#Million Song Dataset#Recall@50
	Named Entity Recognition#NCBI-disease#F1
	Trajectory Prediction#Stanford Drone#ADE-8/12 @K = 20
	Image Clustering#Fashion-MNIST#NMI
	Relation Extraction#TACRED#F1
	Fine-Grained Image Classification#Stanford Dogs#Accuracy
	Link Prediction#Yelp#HR@10
	Color Image Denoising#CBSD68 sigma50#PSNR
	Action Segmentation#50 Salads#F1@10%
	Cross-Lingual NER#CoNLL Spanish#F1
	Machine Translation#WMT2014 English-French#BLEU score
	3D Multi-Person Pose Estimation (absolute)#MuPoTS-3D#3DPCK
	Sentiment Analysis#TweetEval#Sentiment
	RGB-D Salient Object Detection#NJU2K#max F-Measure
	Atari Games#Atari 2600 Solaris#Score
	Depth Completion#KITTI Depth Completion#RMSE
	Entity Linking#WiC-TSV#Task 1 Accuracy: general purpose
	Action Segmentation#50 Salads#Edit
	Interactive Segmentation#GrabCut#NoC@90
	Visual Dialog#Visual Dialog v1.0 test-std#R@5
	Few-Shot Semantic Segmentation#PASCAL-5i (5-Shot)#Mean IoU
	Visual Dialog#Visual Dialog v1.0 test-std#R@1
	Keypoint Detection#COCO test-dev#ARM
	Keypoint Detection#COCO test-dev#ARL
	Link Prediction#MovieLens 25M#nDCG@10
	Image Super-Resolution#Set5 - 2x upscaling#PSNR
	Image Super-Resolution#Manga109 - 2x upscaling#PSNR
	Keypoint Detection#COCO test-dev#APM
	Question Answering#QASent#MAP
	Keypoint Detection#COCO test-dev#APL
	Unsupervised Domain Adaptation#Office-Home (RS-UT imbalance)#Average Per-Class Accuracy
	Visual Question Answering#COCO Visual Question Answering (VQA) real images 1.0 open ended#Percentage correct
	Hate Speech Detection#Ethos Binary#F1-score
	Action Segmentation#Breakfast#F1@25%
	relation_prediction#FB15K-237#H@10
	Adversarial Defense#ImageNet (non-targeted PGD, max perturbation=4)#Accuracy
	Action Segmentation#Breakfast#Edit
	Domain Adaptation#MNIST-to-USPS#Accuracy
	Language Modelling#WikiText-103#Test perplexity
	Time Series Classification#Wafer#Accuracy
	Link Prediction#WN18#Hits@3
	Link Prediction#WN18#Hits@1
	Spoken language identification#VoxForge European#Accuracy (%)
	Birds Eye View Object Detection#KITTI Cars Hard#AP
	Time Series Classification#ECG#Accuracy
	Video Semantic Segmentation#CamVid#Mean IoU
	Link Prediction#FB15k-237#MRR
	Video Super-Resolution#Vid4 - 4x upscaling#MOVIE
	Neural Architecture Search#CIFAR-10#Parameters
	Face Verification#Labeled Faces in the Wild#Accuracy
	Unsupervised Domain Adaptation#Duke to MSMT#mAP
	Few-Shot Image Classification#CUB 200 5-way 1-shot#Accuracy
	Scene Text Detection#MSRA-TD500#Recall
	Machine Translation#IWSLT2015 English-German#BLEU score
	Sentiment Analysis#TweetEval#Offensive
	Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Spanish#Accuracy
	Fact-based Text Editing#WebEdit#Recall
	Semantic Textual Similarity#STS Benchmark#Spearman Correlation
	Vision and Language Navigation#VLN Challenge#error
	Image Clustering#Extended Yale-B#Accuracy
	Object Detection#COCO test-dev#AP75
	Cross-Modal Retrieval#Flickr30k#Text-to-image R@10
	Interactive Segmentation#DAVIS#NoC@85
	Person Re-Identification#CUHK03#Rank-1
	Atari Games#Atari 2600 Gravitar#Score
	Interactive Segmentation#DAVIS#NoC@90
	Code Generation#WikiSQL#Exact Match Accuracy
	Few-Shot Image Classification#Mini-Imagenet 5-way (5-shot)#Accuracy
	Semi-Supervised Image Classification#cifar-100, 10000 Labels#Accuracy
	Object Detection#COCO minival#oLRP
	language_modeling#WikiText-103#Number of params
	Chinese Named Entity Recognition#Resume NER#F1
	Entity Disambiguation#AIDA-CoNLL#In-KB Accuracy
	Speech Enhancement#DEMAND#CSIG
	language_modeling#Penn Treebank#Number of params
	Image Generation#CIFAR-10#FID
	Object Detection#COCO test-dev#AP50
	Grayscale Image Denoising#Set12 sigma15#PSNR
	Semantic Role Labeling#CoNLL 2005#F1
	JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#SSIM
	Unsupervised Machine Translation#WMT2014 English-French#BLEU
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Recall)
	Question Generation#SQuAD1.1#BLEU-4
	Scene Text Detection#ICDAR 2015#Precision
	Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Russian#Accuracy
	3D Object Detection#KITTI Cars Easy val#AP
	3D Human Pose Estimation#3DPW#acceleration error
	Text Simplification#TurkCorpus#BLEU
	Semi-Supervised Image Classification#ImageNet - 10% labeled data#Top 5 Accuracy
	Unsupervised Image Classification#MNIST#Accuracy
	amr_parsing#LDC2014T12#F1 on Full
	dependency_parsing#benchmark Vietnamese dependency treebank VnDT#UAS
	Atari Games#Atari 2600 Video Pinball#Score
	Image Classification#EMNIST-Balanced#Accuracy
	Person Re-Identification#MARS#Rank-5
	Image Clustering#MNIST-test#NMI
	Semantic Similarity#SICK#Spearman Correlation
	Person Re-Identification#MARS#Rank-1
	Link Prediction#Yelp#nDCG@10
	Neural Architecture Search#CIFAR-100#FLOPS
	Question Answering#Quora Question Pairs#Accuracy
	Word Sense Disambiguation#SemEval 2015 Task 13#F1
	Speech Synthesis#North American English#Mean Opinion Score
	Fine-Grained Image Classification#NABirds#Accuracy
	Music Transcription#MusicNet#Number of params
	Link Prediction#FB15k#MRR
	Image Retrieval#Flickr30K 1K test#R@10
	Mortality Prediction#MIMIC-III#Recall
	Text Simplification#PWKP / WikiSmall#BLEU
	Neural Architecture Search#CIFAR-100#PARAMS
	Semantic Role Labeling (predicted predicates)#CoNLL 2012#F1
	Fact-based Text Editing#WebEdit#DELETE
	Grammatical Error Correction#CoNLL-2014 Shared Task#F0.5
	Scene Text Detection#ICDAR 2015#Recall
	3D Object Detection#KITTI Cars Hard#AP
	Neural Architecture Search#CIFAR-100#Percentage Error
	Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-French#Accuracy
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Decay)
	Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Laptop#F1
	Node Classification#CiteSeer with Public Split: fixed 20 nodes per class#Accuracy
	Temporal Action Localization#THUMOS’14#mAP IOU@0.2
	Temporal Action Localization#THUMOS’14#mAP IOU@0.3
	Subjectivity Analysis#SUBJ#Accuracy
	Temporal Action Localization#THUMOS’14#mAP IOU@0.1
	Temporal Action Localization#THUMOS’14#mAP IOU@0.6
	Temporal Action Localization#THUMOS’14#mAP IOU@0.7
	Temporal Action Localization#THUMOS’14#mAP IOU@0.4
	Real-time Instance Segmentation#MSCOCO#APL
	Temporal Action Localization#THUMOS’14#mAP IOU@0.5
	Real-time Instance Segmentation#MSCOCO#APM
	Question Answering#bAbi#Accuracy (trained on 10k)
	Real-time Instance Segmentation#MSCOCO#APS
	Speech Recognition#TIMIT#Percentage error
	Visual Dialog#Visual Dialog v1.0 test-std#Mean
	Graph Classification#NEURON-BINARY#Accuracy
	Language Modelling#Penn Treebank (Word Level)#Test perplexity
	Unsupervised Machine Translation#WMT2014 French-English#BLEU
	Video Retrieval#MSVD#text-to-video R@5
	RGB-D Salient Object Detection#NJU2K#Average MAE
	Video Retrieval#MSVD#text-to-video R@1
	text_classification#AG News#Error
	Pose Estimation#MPII Human Pose#PCKh-0.5
	Scene Text Detection#MSRA-TD500#Precision
	3D Human Pose Estimation#3DPW#PA-MPJPE
	Image Clustering#ImageNet-10#NMI
	Face Alignment#WFLW#FR@0.1(%, all)
	Image-to-Image Translation#COCO-Stuff Labels-to-Photos#FID
	relationship_extraction#New York Times Corpus#P@30%
	Fine-Grained Image Classification#Caltech-101#Top-1 Error Rate
	Human-Object Interaction Detection#V-COCO#MAP
	Conversational Response Selection#PolyAI Reddit#1-of-100 Accuracy
	Semi-Supervised Semantic Segmentation#Cityscapes 12.5% labeled#Validation mIoU
	Fact-based Text Editing#WebEdit#BLEU
	Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Accuracy (Test)
	Object Counting#Pascal VOC 2007 count-test#mRMSE-nz
	Sentiment Analysis#IMDb#Accuracy
	Image Generation#Binarized MNIST#nats
	3D Object Detection#ScanNetV2#mAP@0.25
	Lane Detection#CULane#F1 score
	Unsupervised Domain Adaptation#Duke to MSMT#rank-10
	Image Clustering#Imagenet-dog-15#NMI
	Image Super-Resolution#Set14 - 3x upscaling#PSNR
	Dialogue State Tracking#Wizard-of-Oz#Request
	Pedestrian Detection#Caltech#Reasonable Miss Rate
	Instance Segmentation#COCO minival#mask AP
	Relation Extraction#ADE Corpus#RE+ Macro F1
	Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Decay)
	Semi-Supervised Image Classification#SVHN, 1000 labels#Accuracy
	Time Series Classification#KickvsPunch#NLL
	Person Re-Identification#CUHK03 labeled#Rank-1
	Semi-Supervised Video Object Segmentation#YouTube-VOS#F-Measure (Unseen)
	JPEG Artifact Correction#LIVE1 (Quality 10 Color)#SSIM
	Atari Games#Atari 2600 Tennis#Score
	3D Object Reconstruction#Data3D−R2N2#Avg F1
	Question Answering#QASent#MRR
	Traffic Prediction#PeMS-M#MAE (60 min)
	Constituency Grammar Induction#PTB#Max F1 (WSJ)
	Conditional Image Generation#CIFAR-10#FID
	Visual Question Answering#VQA v2 test-std#yes/no
	Image Classification#Flowers-102#Accuracy
	Image Super-Resolution#Set5 - 4x upscaling#SSIM
	Recommendation Systems#MovieLens 1M#RMSE
	Action Segmentation#Breakfast#F1@10%
	Graph Classification#ENZYMES#Accuracy
	Unsupervised Facial Landmark Detection#MAFL#NME
	Keypoint Detection#COCO test-dev#AR50
	Depth Completion#KITTI Depth Completion#Runtime [ms]
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#PSNR
	Image Super-Resolution#Urban100 - 4x upscaling#SSIM
	Constituency Parsing#Penn Treebank#F1 score
	Person Re-Identification#CUHK03 labeled#MAP
	Keypoint Detection#COCO test-dev#AR75
	Panoptic Segmentation#Cityscapes val#mIoU
	Relation Extraction#ADE Corpus#NER Macro F1
	Semi-Supervised Video Object Segmentation#YouTube#mIoU
	Object Detection#UAVDT#mAP
	Keypoint Detection#COCO test-challenge#ARL
	Keypoint Detection#COCO test-challenge#ARM
	Question Answering#WikiQA#MRR
	Image Generation#Cityscapes#FID-10k-training-steps
	Real-time Instance Segmentation#MSCOCO#Frame (fps)
	Few-Shot Image Classification#FC100 5-way (5-shot)#Accuracy
	word_segmentation#Chinese Treebank 6#F1
	summarization#CNN / Daily Mail (Anonymized version)#ROUGE-2
	summarization#CNN / Daily Mail (Anonymized version)#ROUGE-1
	Cross-Lingual NER#CoNLL Dutch#F1
	Natural Language Inference#FarsTail#% Test Accuracy
	Scene Text Detection#Total-Text#Precision
	Link Prediction#YAGO3-10#Hits@3
	Link Prediction#YAGO3-10#Hits@1
	Word Sense Disambiguation#SemEval 2007 Task 17#F1
	Neural Architecture Search#CIFAR-10#Search Time (GPU days)
	3D Object Detection#KITTI Pedestrians Hard#AP
	word_segmentation#VLSP 2013 word segmentation shared task#F1
	Image Clustering#Tiny-ImageNet#Accuracy
	summarization#CNN / Daily Mail (Anonymized version)#ROUGE-L
	Visual Question Answering#VQA-CP#Score
	Node Classification#USA Air-Traffic#Accuracy
	Image Clustering#CIFAR-10#ARI
	Image/Document Clustering#pendigits#runtime (s)
	Action Segmentation#GTEA#Edit
	Weakly Supervised Action Localization#ActivityNet-1.3#mAP@0.5
	Panoptic Segmentation#Cityscapes test#PQ
	taxonomy_learning#SemEval 2018#MAP
	AMR Parsing#LDC2014T12#F1 Full
	sentiment_analysis#SemEval-2014 Task 4 subtask 2 Aspect Term Polarity#Laptop (acc)
	Keypoint Detection#COCO test-challenge#APL
	Fundus to Angiography Generation#Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients#Kernel Inception Distance
	Hate Speech Detection#HateXplain#Accuracy
	Image Denoising#SIDD#SSIM (sRGB)
	Document Summarization#CNN / Daily Mail#ROUGE-1
	Document Summarization#CNN / Daily Mail#ROUGE-2
	Few-Shot Object Detection#MS-COCO (10-shot)#AP
	Time Series Classification#PenDigits#NLL
	word_segmentation#MSR#F1
	3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
	Semantic Segmentation#SkyScapes-Dense#Mean IoU
	Object Counting#COCO count-test#m-reIRMSE
	Visual Question Answering#GQA Test2019#Accuracy
	Speech Enhancement#DEMAND#PESQ
	Node Classification#Cornell#Accuracy
	Document Summarization#CNN / Daily Mail#ROUGE-L
	Grammatical Error Correction#BEA-2019 (test)#F0.5
	Visual Question Answering#GQA test-std#Accuracy
	Click-Through Rate Prediction#Amazon#AUC
	Multimodal Machine Translation#Multi30K#BLEU (EN-DE)
	Skeleton Based Action Recognition#NTU RGB+D 120#Accuracy (Cross-Setup)
	Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.3-0.7)
	Open-Domain Question Answering#SearchQA#N-gram F1
	Keypoint Detection#COCO test-challenge#AR50
	RGB-D Salient Object Detection#NJU2K#max E-Measure
	Domain Adaptation#SYNSIG-to-GTSRB#Accuracy
	Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#PSNR
	Keypoint Detection#COCO test-challenge#AR75
	Retinal Vessel Segmentation#STARE#AUC
	Stochastic Optimization#CIFAR-100 WRN-28-10 - 200 Epochs#Accuracy
	Spoken language identification#LRE07#3 sec
	3D Semantic Segmentation#SemanticKITTI#mIoU
	Text Summarization#arXiv#ROUGE-1
	Text Summarization#arXiv#ROUGE-2
	Image Matting#Composition-1K#SAD
	Vision and Language Navigation#VLN Challenge#length
	Object Counting#COCO count-test#mRMSE
	Scene Text Recognition#SVT#Accuracy
	Atari Games#Atari 2600 Demon Attack#Score
	Lipreading#Lip Reading in the Wild#Top-1 Accuracy
	Image Classification#Flowers-102#PARAMS
	Time Series Classification#CharacterTrajectories#NLL
	Text Summarization#arXiv#ROUGE-L
	question_answering#CNN / Daily Mail#Accuracy on Daily Mail
	Instance Segmentation#iSAID#Average Precision
	Single Image Deraining#Test1200#PSNR
	Visual Question Answering#VQA v1 test-dev#Accuracy
	Word Sense Disambiguation#SemEval 2007 Task 7#F1
	Multimodal Activity Recognition#EV-Action#Accuracy
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Decay)
	Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#MS-SSIM
	Entity Linking#WiC-TSV#Task 3 Accuracy: domain specific
	relationship_extraction#SemEval-2010 Task 8#F1
	Recommendation Systems#MovieLens 1M#HR@10
	Named Entity Recognition#ACE 2004#F1
	Node Classification#Facebook#Accuracy
	Action Detection#Charades#mAP
	Atari Games#Atari 2600 Amidar#Score
	Image Classification#WebVision-1000#ImageNet Top-1 Accuracy
	Scene Text Detection#ICDAR 2017 MLT#Precision
	Fact-based Text Editing#WebEdit#KEEP
	Visual Object Tracking#LaSOT#AUC
	Image Classification#iNaturalist#Top 1 Accuracy
	Graph Classification#UPFD-POL#Accuracy (%)
	Skeleton Based Action Recognition#N-UCLA#Accuracy
	Scene Text Detection#ICDAR 2017 MLT#Recall
	Conditional Image Generation#ImageNet 128x128#FID
	language_modeling#1B Words / Google Billion Word benchmark#Test perplexity
	6D Pose Estimation#YCB-Video#ADDS AUC
	Semi-Supervised Image Classification#CIFAR-10, 250 Labels#Accuracy
	Semi-Supervised Video Object Segmentation#YouTube-VOS#F-Measure (Seen)
	Image Super-Resolution#Manga109 - 4x upscaling#SSIM
	Panoptic Segmentation#COCO panoptic#PQst
	machine_translation#WMT 2014 EN-FR#BLEU
	Entity Linking#WiC-TSV#Task 3 Accuracy: all
	Pose Estimation#COCO test-dev#AP50
	Few-Shot Image Classification#Stanford Dogs 5-way (5-shot)#Accuracy
	Panoptic Segmentation#COCO panoptic#PQth
	Atari Games#Atari 2600 Chopper Command#Score
	Time Series Classification#PEMS#NLL
	Question Answering#SQuAD2.0 dev#F1
	Question Answering#SQuAD2.0 dev#EM
	Natural Language Inference#MultiNLI#Matched
	Dense Pixel Correspondence Estimation#HPatches#Viewpoint V AEPE
	Unsupervised Domain Adaptation#Market to Duke#mAP
	Time Series Classification#NetFlow#NLL
	Node Classification#PPI#F1
	Temporal Action Proposal Generation#ActivityNet-1.3#AR@100
	Sequential Image Classification#Sequential MNIST#Permuted Accuracy
	Click-Through Rate Prediction#Bing News#Log Loss
	Neural Architecture Search#CIFAR-10 Image Classification#Percentage error
	JPEG Artifact Correction#ICB (Quality 20 Color)#PSNR
	Data-to-Text Generation#WebNLG Full#BLEU
	Pose Estimation#Leeds Sports Poses#PCK
	Person Re-Identification#Market-1501#Rank-5
	Semantic Segmentation#COCO-Stuff test#mIoU
	Person Re-Identification#Market-1501#Rank-1
	JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#PSNR
	Conditional Image Generation#CIFAR-10#Inception score
	Pose Estimation#COCO test-dev#AP75
	Image Generation#CelebA 256x256#bpd
	Object Detection#KITTI Cars Easy#AP
	Reading Comprehension#RACE#Accuracy (Middle)
	Unsupervised Domain Adaptation#Cityscapes to Foggy Cityscapes#mAP@0.5
	Real-Time Semantic Segmentation#Cityscapes test#Time (ms)
	Ad-Hoc Information Retrieval#TREC Robust04#MAP
	Image Clustering#CIFAR-100#Accuracy
	Image Clustering#USPS#Accuracy
	Question Answering#CNN / Daily Mail#CNN
	Image Retrieval#CARS196#R@1
	Image Super-Resolution#Set5 - 8x upscaling#SSIM
	Fine-Grained Image Classification#Oxford-IIIT Pets#Top-1 Error Rate
	Neural Architecture Search#CIFAR-10#Top-1 Error Rate
	Image Clustering#USPS#NMI
	Real-Time Semantic Segmentation#NYU Depth v2#mIoU
	Node Classification#Citeseer Full-supervised#Accuracy
	Atari Games#Atari 2600 Battle Zone#Score
	Graph Regression#Lipophilicity#RMSE
	Video Instance Segmentation#YouTube-VIS validation#AP75
	Image Classification#ImageNet V2#Top 1 Accuracy
	Action Segmentation#Breakfast#Acc
	Scene Text Recognition#ICDAR2013#Accuracy
	Few-Shot Image Classification#Tiered ImageNet 10-way (1-shot)#Accuracy
	Semantic Segmentation#S3DIS Area5#mAcc
	Cross-Modal Retrieval#COCO 2014#Image-to-text R@10
	Object Counting#Pascal VOC 2007 count-test#m-relRMSE
	Link Prediction#FB15k-237#MR
	Spoken language identification#LRE07#10 sec
	Video Instance Segmentation#YouTube-VIS validation#AP50
	Text Classification#R8#Accuracy
	Node Classification#Wikipedia#Macro-F1
	Atari Games#Atari 2600 Alien#Score
	Atari Games#Atari 2600 Q*Bert#Score
	Single Image Deraining#Rain100L#PSNR
	Image Super-Resolution#Set14 - 8x upscaling#PSNR
	Question Answering#NarrativeQA#METEOR
	Single Image Deraining#Test2800#PSNR
	3D Object Detection#nuScenes#mAP
	Optical Flow Estimation#Sintel-clean#Average End-Point Error
	Image Classification#Oxford-IIIT Pets#Accuracy
	Object Detection#KITTI Cars Moderate#AP
	Grayscale Image Denoising#Urban100 sigma50#PSNR
	Atari Games#Atari 2600 Defender#Score
	Zero-Shot Learning#SUN Attribute#average top-1 classification accuracy
	Semantic Textual Similarity#SentEval#MRPC
	Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: domain specific
	Few-Shot Object Detection#MS-COCO (30-shot)#AP
	relationship_extraction#New York Times Corpus#P@10%
	Few-Shot Image Classification#Mini-Imagenet 5-way (1-shot)#Accuracy
	3D Human Pose Estimation#MPI-INF-3DHP#MJPE
	Graph Classification#HIV-fMRI-77#F1
	Sentiment Analysis#TweetEval#ALL
	Single Image Deraining#Rain100H#SSIM
	Medical Image Segmentation#CVC-ClinicDB#mean Dice
	Video Generation#UCF-101 16 frames, 64x64, Unconditional#Inception Score
	question_answering#Quasar#EM (Quasar-T)
	Person Re-Identification#Market-1501#Rank-10
	Question Answering#CNN / Daily Mail#Daily Mail
	Video Object Detection#ImageNet VID#MAP
	Weakly Supervised Action Localization#THUMOS 2014#mAP@0.5
	Humor Detection#200k Short Texts for Humor Detection#F1-score
	Node Classification#Flickr#Accuracy
	Multi-Object Tracking#MOT17#MOTA
	Sentiment Analysis#Amazon Review Full#Accuracy
	Language Modelling#Hutter Prize#Bit per Character (BPC)
	Semantic Segmentation#ScanNet#3DIoU
	Semantic Segmentation#ADE20K#Test Score
	Crowd Counting#UCF-QNRF#MAE
	word_sense_disambiguation#SemEval 2007#F1
	Question Answering#WikiQA#MAP
	Image-to-Image Translation#COCO-Stuff Labels-to-Photos#mIoU
	Keypoint Detection#COCO test-dev#AP50
	Semantic Segmentation#Nighttime Driving#mIoU
	Semantic Textual Similarity#SICK#Spearman Correlation
	Text-to-Image Generation#CUB#Inception score
	Visual Dialog#Visual Dialog v1.0 test-std#R@10
	Mortality Prediction#MIMIC-III#Precision
	Keypoint Detection#COCO test-dev#AP75
	Dependency Parsing#Penn Treebank#UAS
	Graph Classification#NCI109#Accuracy
	Text Summarization#X-Sum#ROUGE-3
	Text Summarization#X-Sum#ROUGE-2
	Text Summarization#X-Sum#ROUGE-1
	Unsupervised Domain Adaptation#Duke to MSMT#rank-1
	Person Search#CUHK-SYSU#MAP
	Unsupervised Domain Adaptation#Duke to MSMT#rank-5
	Semantic Role Labeling#OntoNotes#F1
	Semantic Similarity#SICK#Pearson Correlation
	Video Retrieval#LSMDC#text-to-video R@10
	Image Classification#VTAB-1k#Top-1 Accuracy
	Anomaly Detection#Unlabeled CIFAR-10 vs CIFAR-100#AUROC
	Line Segment Detection#wireframe dataset#sAP5
	Domain Adaptation#SVNH-to-MNIST#Accuracy
	3D Point Cloud Classification#ScanObjectNN#Overall Accuracy
	Vehicle Pose Estimation#KITTI Cars Hard#Average Orientation Similarity
	Weakly Supervised Object Detection#PASCAL VOC 2012 test#MAP
	Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Laptop (Acc)
	Few-Shot Image Classification#OMNIGLOT - 1-Shot, 5-way#Accuracy
	Language Modelling#WikiText-2#Test perplexity
	Graph Classification#IMDb-B#Accuracy
	sentiment_analysis#SST-2#Accuracy
	Multi-tissue Nucleus Segmentation#Kumar#Hausdorff Distance (mm)
	Hate Speech Detection#Ethos Binary#Precision
	Time Series Classification#AUSLAN#Accuracy
	Click-Through Rate Prediction#Dianping#AUC
	Face Verification#Trillion Pairs Dataset#Accuracy
	Sentiment Analysis#TweetEval#Irony
	dependency_parsing#Penn Treebank#LAS
	Sentiment Analysis#MR#Accuracy
	Video Generation#UCF-101 16 frames, Unconditional, Single GPU#Inception Score
	Unsupervised Machine Translation#WMT2016 English-German#BLEU
	Node Classification#Wisconsin#Accuracy
	Cross-Modal Retrieval#COCO 2014#Text-to-image R@5
	Cross-Modal Retrieval#COCO 2014#Text-to-image R@1
	Video Instance Segmentation#YouTube-VIS validation#AR1
	Question Answering#NewsQA#F1
	Visual Object Tracking#VOT2017#Expected Average Overlap (EAO)
	Node Classification#Wikipedia#Accuracy
	Action Classification#Kinetics-700#Top-1 Accuracy
	Atari Games#Atari 2600 Kung-Fu Master#Score
	Image Classification#CIFAR-100#Percentage correct
	Machine Translation#WMT2014 German-English#BLEU score
	Object Counting#Pascal VOC 2007 count-test#m-reIRMSE-nz
	Trajectory Prediction#Stanford Drone#FDE-8/12 @K= 20
	Zero-Shot Learning#CUB-200-2011#average top-1 classification accuracy
	Word Sense Disambiguation#Supervised:#SemEval 2015
	Named Entity Recognition#BC5CDR#F1
	Word Sense Disambiguation#Supervised:#SemEval 2013
	Word Sense Disambiguation#Supervised:#SemEval 2007
	Language Modelling#WikiText-2#Number of params
	Line Segment Detection#wireframe dataset#sAP15
	Line Segment Detection#wireframe dataset#sAP10
	Node Classification#Pubmed#Accuracy
	Neural Architecture Search#CIFAR-10 Image Classification#FLOPS
	Visual Object Tracking#GOT-10k#Success Rate 0.5
	Retinal OCT Disease Classification#OCT2017#Acc
	Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Dice
	Lane Detection#TuSimple#Accuracy
	summarization#CNN / Daily Mail (Non-anonymized version)#METEOR
	Image Clustering#CIFAR-10#Backbone
	Neural Architecture Search#NAS-Bench-201, CIFAR-10#Accuracy (Test)
	6D Pose Estimation using RGBD#LineMOD#Mean ADD
	text_classification#DBpedia#Error
	Person Re-Identification#MARS#mAP
	Visual Question Answering#COCO Visual Question Answering (VQA) real images 1.0 multiple choice#Percentage correct
	Time Series Classification#KickvsPunch#Accuracy
	Hyperspectral Image Classification#Pavia University#Overall Accuracy
	Text Simplification#TurkCorpus#SARI (EASSE>=0.2.1)
	Graph Clustering#Cora#Accuracy
	Vision and Language Navigation#VLN Challenge#spl
	Crowd Counting#UCF CC 50#MAE
	Keypoint Detection#COCO test-challenge#AP50
	Video Retrieval#LSMDC#text-to-video Median Rank
	Sentiment Analysis#TweetEval#Stance
	chunking#Penn Treebank#F1
	Keypoint Detection#COCO test-challenge#AP75
	Relation Extraction#ACE 2004#NER Micro F1
	Semi-Supervised Image Classification#ImageNet - 10% labeled data#Top 1 Accuracy
	Atari Games#Atari 2600 HERO#Score
	Multi-tissue Nucleus Segmentation#Kumar#Dice
	Link Prediction#WN18#Hits@10
	Semantic Segmentation#S3DIS#mAcc
	Image Super-Resolution#BSD100 - 4x upscaling#SSIM
	Image Classification#mini WebVision 1.0#ImageNet Top-1 Accuracy
	Anomaly Detection#One-class ImageNet-30#AUROC
	Few-Shot Image Classification#Tiered ImageNet 5-way (1-shot)#Accuracy
	Neural Architecture Search#ImageNet#Params
	Multimodal Activity Recognition#Moments in Time Dataset#Top-5 (%)
	question_answering#SearchQA#EM
	question_answering#SearchQA#F1
	Image-to-Image Translation#Cityscapes Labels-to-Photo#Per-pixel Accuracy
	Real-Time Semantic Segmentation#CamVid#Frame (fps)
	Image Generation#CIFAR-10#Inception score
	Click-Through Rate Prediction#MovieLens 20M#AUC
	summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-L
	Action Recognition#NTU RGB+D#Accuracy (CV)
	Cross-Modal Retrieval#Flickr30k#Image-to-text R@5
	Cross-Modal Retrieval#Flickr30k#Image-to-text R@1
	Semantic Segmentation#ADE20K val#mIoU
	Multi-Label Classification#PASCAL VOC 2007#mAP
	Ad-Hoc Information Retrieval#TREC Robust04#nDCG@20
	Scene Text Detection#Total-Text#Recall
	Unsupervised Person Re-Identification#DukeMTMC-reID#Rank-1
	Birds Eye View Object Detection#KITTI Cars Easy#AP
	Emotion Recognition in Conversation#MELD#Weighted Macro-F1
	Graph Classification#UPFD-GOS#Accuracy (%)
	Named Entity Recognition#CoNLL 2003 (German)#F1
	Person Re-Identification#MSMT17#mAP
	Image Matting#Composition-1K#Grad
	Birds Eye View Object Detection#KITTI Pedestrians Moderate#AP
	Atari Games#Atari 2600 Space Invaders#Score
	Real-Time Object Detection#PASCAL VOC 2007#MAP
	Graph Regression#ZINC#MAE
	Sentiment Analysis#Multi-Domain Sentiment Dataset#Electronics
	Action Recognition#NTU RGB+D#Accuracy (CS)
	Semantic Textual Similarity#SentEval#STS
	Neural Architecture Search#NAS-Bench-201, CIFAR-100#Search time (s)
	Node Classification#MAG240M-LSC#Test Accuracy
	summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-1
	summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-2
	Retinal OCT Disease Classification#Srinivasan2014#Acc
	Skeleton Based Action Recognition#SYSU 3D#Accuracy
	Video Frame Interpolation#Middlebury#Interpolation Error
	Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: all
	Grammatical Error Correction#JFLEG#GLEU
	Grayscale Image Denoising#BSD68 sigma50#PSNR
	Facial Expression Recognition#AffectNet#Accuracy (8 emotion)
	Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-L
	Link Prediction#WN18RR#MRR
	Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-2
	Linguistic Acceptability#CoLA#Accuracy
	Sentiment Analysis#Multi-Domain Sentiment Dataset#Average
	Graph Classification#HIV-fMRI-77#Accuracy
	Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-1
	Monocular Depth Estimation#NYU-Depth V2#RMSE
	Colorectal Gland Segmentation:#CRAG#F1-score
	Video Retrieval#MSVD#text-to-video R@10
	Fact-based Text Editing#WebEdit#Precision
	Speech Recognition#MediaSpeech#WER for Spanish
	Metric Learning#CARS196#R@1
	Action Classification#Moments in Time#Top 1 Accuracy
	Node Classification#Cora (0.5%)#Accuracy
	Question Answering#SQuAD1.1 dev#F1
	Question Answering#SQuAD1.1 dev#EM
	Video Instance Segmentation#YouTube-VIS validation#AR10
	Few-Shot Image Classification#Tiered ImageNet 10-way (5-shot)#Accuracy
	Few-Shot Image Classification#Mini-ImageNet-CUB 5-way (1-shot)#Accuracy
	Weakly Supervised Object Detection#PASCAL VOC 2007#MAP
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Recall)
	Image Retrieval#Par106k#mAP
	Fake News Detection#FNC-1#Per-class Accuracy (Agree)
	Fundus to Angiography Generation#Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients#FID
	Atari Games#Atari 2600 Centipede#Score
	Image Generation#STL-10#FID
	Image Clustering#CIFAR-100#Train Set
	Weakly Supervised Object Detection#Charades#MAP
	part-of-speech_tagging#Penn Treebank#Accuracy
	word_sense_disambiguation#SemEval 2013#F1
	Unsupervised Domain Adaptation#Duke to Market#mAP
	Video Super-Resolution#Vid4 - 4x upscaling#SSIM
	Speech Enhancement#Deep Noise Suppression (DNS) Challenge#PESQ-NB
	JPEG Artifact Correction#ICB (Quality 10 Color)#SSIM
	Few-Shot Image Classification#Mini-Imagenet 10-way (5-shot)#Accuracy
	Multi-Person Pose Estimation#COCO test-dev#AP75
	Image Denoising#SIDD#PSNR (sRGB)
	RGB-D Salient Object Detection#NLPR#max F-Measure
	Action Recognition#EPIC-KITCHENS-100#Noun@1
	Node Classification#BlogCatalog#Accuracy
	Speech Enhancement#DEMAND#COVL
	Named Entity Recognition#CoNLL 2002 (Spanish)#F1
	Multi-Person Pose Estimation#COCO test-dev#AP50
	Time Series Classification#ArabicDigits#NLL
	Referring Expression Segmentation#RefCOCO testA#IoU
	Joint Entity and Relation Extraction#SciERC#Relation F1
	Action Segmentation#Breakfast#F1@50%
	Face Identification#Trillion Pairs Dataset#Accuracy
	Neural Architecture Search#ImageNet#MACs
	Sentiment Analysis#SST-2 Binary classification#Accuracy
	Monocular 3D Human Pose Estimation#Human3.6M#Use Video Sequence
	Relation Extraction#ChemProt#F1
	Atari Games#Atari 2600 Double Dunk#Score
	Node Classification#Citeseer#Validation
	Semi-Supervised Image Classification#SVHN, 250 Labels#Accuracy
	RGB-D Salient Object Detection#SIP#S-Measure
	Data-to-Text Generation#MULTIWOZ 2.1#BLEU
	Image Super-Resolution#Set14 - 2x upscaling#PSNR
	Self-Supervised Action Recognition#HMDB51#Pre-Training Dataset
	Video Retrieval#MSR-VTT-1kA#text-to-video R@5
	Video Retrieval#MSR-VTT-1kA#text-to-video R@1
	Instance Segmentation#COCO minival#AP50
	Object Detection#COCO test-dev#APS
	RGB-D Salient Object Detection#STERE#Average MAE
	Scene Text Recognition#ICDAR 2003#Accuracy
	Click-Through Rate Prediction#Criteo#AUC
	Node Classification#Citeseer#Accuracy
	JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#PSNR-B
	Speech Enhancement#Deep Noise Suppression (DNS) Challenge#PESQ-WB
	Recommendation Systems#MovieLens 20M#Recall@20
	Instance Segmentation#COCO minival#AP75
	Sentiment Analysis#SemEval 2014 Task 4 Subtask 1+2#F1
	Image Classification#mini WebVision 1.0#Top-5 Accuracy
	Abstractive Text Summarization#CNN / Daily Mail#ROUGE-L
	Neural Architecture Search#NAS-Bench-201, CIFAR-10#Accuracy (val)
	Abstractive Text Summarization#CNN / Daily Mail#ROUGE-1
	Abstractive Text Summarization#CNN / Daily Mail#ROUGE-2
	Audio Classification#ESC-50#Top-1 Accuracy
	Object Detection#COCO test-dev#APM
	Object Detection#COCO test-dev#APL
	Retinal Vessel Segmentation#DRIVE#F1 score
	Music Modeling#Nottingham#NLL
	Fine-Grained Image Classification#Food-101#Accuracy
	Common Sense Reasoning#Winograd Schema Challenge#Score
	language_modeling#Hutter Prize#Number of params
	Quantization#ImageNet#Accuracy (%)
	Language Modelling#Penn Treebank (Character Level)#Number of params
	Music Source Separation#MUSDB18#SDR (drums)
	Machine Translation#WMT2016 English-German#BLEU score
	Link Prediction#OpenBioLink#Hits@10
	Image Generation#ImageNet 64x64#Bits per dim
	Few-Shot Image Classification#Mini-ImageNet-CUB 5-way (5-shot)#Accuracy
	Fine-Grained Image Classification#Oxford-IIIT Pets#PARAMS
	Grammatical Error Detection#CoNLL-2014 A1#F0.5
	Object Counting#COCO count-test#m-reIRMSE-nz
	Image Clustering#MNIST-full#Accuracy
	Visual Object Tracking#OTB-2013#AUC
	Bias Detection#StereoSet#ICAT Score
	Line Segment Detection#wireframe dataset#F1 score
	Image-to-Image Translation#ADE20K-Outdoor Labels-to-Photos#FID
	Single Image Deraining#Test100#PSNR
	Visual Dialog#Visual Dialog v1.0 test-std#NDCG (x 100)
	JPEG Artifact Correction#LIVE1 (Quality 20 Color)#PSNR
	Birds Eye View Object Detection#KITTI Cars Moderate#AP
	Language Modelling#WikiText-2#Validation perplexity
	Machine Translation#IWSLT2014 German-English#BLEU score
	Graph Classification#REDDIT-B#Accuracy
	Recommendation Systems#Netflix#nDCG@100
	Image Classification#ImageNet#Top 1 Accuracy
	Natural Language Inference#SciTail#Accuracy
	Weakly Supervised Action Localization#THUMOS 2014#mAP@0.1:0.7
	Weakly Supervised Action Localization#THUMOS 2014#mAP@0.1:0.5
	Scene Text Recognition#ICDAR2015#Accuracy
	Image Super-Resolution#Set5 - 3x upscaling#SSIM
	Crowd Counting#ShanghaiTech A#MAE
	Semi-Supervised Video Object Segmentation#YouTube-VOS#Overall
	Recommendation Systems#Douban Monti#RMSE
	Open-Domain Question Answering#Quasar#F1 (Quasar-T)
	Instance Segmentation#COCO minival#APL
	Instance Segmentation#COCO minival#APM
	Instance Segmentation#COCO minival#APS
	Semi-Supervised Video Object Segmentation#YouTube-VOS#Jaccard (Seen)
	Object Detection#KITTI Cars Hard#AP
	Task-Oriented Dialogue Systems#KVRET#Entity F1
	3D Object Detection#KITTI Pedestrians Moderate#AP
	Multi-Person Pose Estimation#CrowdPose#mAP @0.5:0.95
	Motion Segmentation#Apolloscape#Accuracy
	Semantic Segmentation#ADE20K#Validation mIoU
	Action Recognition#EPIC-KITCHENS-100#Verb@1
	Action Recognition#THUMOS’14#mAP@0.3
	Action Recognition#THUMOS’14#mAP@0.4
	Action Recognition#THUMOS’14#mAP@0.5
	named_entity_recognition#Ontonotes v5 (English)#F1
	Action Recognition#THUMOS’14#mAP@0.1
	Action Recognition#THUMOS’14#mAP@0.2
	Action Segmentation#GTEA#F1@10%
	language_modeling#WikiText-103#Test perplexity
	Image-to-Image Translation#GTAV-to-Cityscapes Labels#mIoU
	Continual Learning#visual domain decathlon (10 tasks)#decathlon discipline (Score)
	Aspect Sentiment Triplet Extraction#SemEval#F1
	Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#SSIM
	Video Generation#BAIR Robot Pushing#FVD score
	Relation Extraction#ACE 2004#RE+ Micro F1
	Multi-Person Pose Estimation#COCO test-dev#AP
	Monocular Depth Estimation#KITTI Eigen split#absolute relative error
	Atari Games#Atari 2600 Tutankham#Score
	RGB-D Salient Object Detection#LFSD#Average MAE
	Unsupervised Domain Adaptation#Duke to Market#rank-10
	Dense Video Captioning#ActivityNet Captions#METEOR
	Image Super-Resolution#Set14 - 4x upscaling#PSNR
	Domain Adaptation#Office-31#Average Accuracy
	3D Object Detection#KITTI Cyclists Moderate#AP
	Reading Comprehension#RACE#Accuracy
	Panoptic Segmentation#Cityscapes val#PQst
	Scene Text Detection#SCUT-CTW1500#Precision
	Speech Separation#wsj0-2mix#SI-SDRi
	question_answering#SearchQA#Unigram Acc
	Panoptic Segmentation#Cityscapes val#PQth
	Self-Supervised Image Classification#ImageNet (finetuned)#Top 1 Accuracy
	Unsupervised Domain Adaptation#Market to Duke#rank-10
	Continuous Control#PyBullet HalfCheetah#Return
	language_modeling#Penn Treebank#Bit per Character (BPC)
	amr_parsing#LDC2014T12#F1 on Newswire
	Time Series Classification#JapaneseVowels#Accuracy
	Weakly-supervised 3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
	Face Verification#IJB-C#TAR @ FAR=0.01
	3D Human Pose Estimation#3DPW#MPJPE
	Neural Architecture Search#ImageNet#Top-1 Error Rate
	Fine-Grained Image Classification#Birdsnap#Accuracy
	Fact-based Text Editing#WebEdit#ADD
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#SSIM
	Protein Secondary Structure Prediction#CB513#Q8
	3D Object Detection#KITTI Cars Moderate val#AP
	Action Recognition#UCF101#3-fold Accuracy
	Dense Object Detection#SKU-110K#AP
	Image Retrieval#Oxf105k#MAP
	Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (AV I)
	Sequential Image Classification#Sequential MNIST#Unpermuted Accuracy
	Node Classification#Coauthor CS#Accuracy
	Graph Classification#CIFAR10 100k#Accuracy (%)
	RGB-D Salient Object Detection#DES#Average MAE
	question_answering#SQuAD#F1
	question_answering#SQuAD#EM
	Image-to-Image Translation#Cityscapes Photo-to-Labels#Per-class Accuracy
	Video Object Detection#ImageNet VID#runtime (ms)
	Video Retrieval#MSR-VTT-1kA#text-to-video R@10
	Real-Time Object Detection#COCO#MAP
	Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Search time (s)
	Temporal Action Proposal Generation#ActivityNet-1.3#AUC (val)
	Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Restaurant (Acc)
	Time Series Classification#ArabicDigits#Accuracy
	Conditional Image Generation#ImageNet 128x128#Inception score
	Face Alignment#WFLW#AUC@0.1 (all)
	Image Classification#SVHN#Percentage error
	Semantic Textual Similarity#STS14#Spearman Correlation
	Multi-Person Pose Estimation#COCO test-dev#APL
	Multi-Person Pose Estimation#COCO test-dev#APM
	Neural Architecture Search#NAS-Bench-201, CIFAR-100#Accuracy (Test)
	3D Instance Segmentation#S3DIS#mRec
	Image Retrieval#In-Shop#R@1
	Photo geolocation estimation#Im2GPS#Continent level (2500 km)
	Graph Classification#MUTAG#Accuracy
	Recommendation Systems#MovieLens 100K#RMSE (u1 Splits)
	Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: general purpose
	Real-Time Object Detection#COCO#inference time (ms)
	3D Object Detection#KITTI Pedestrians Easy#AP
	Real-time Instance Segmentation#MSCOCO#mask AP
	Image Classification#MNIST#Accuracy
	Image Clustering#CIFAR-10#Train set
	Real-Time Object Detection#PASCAL VOC 2007#FPS
	Pedestrian Detection#CityPersons#Bare MR^-2
	Unsupervised Domain Adaptation#Duke to Market#rank-5
	Semantic Segmentation#Cityscapes val#mIoU
	Unsupervised Domain Adaptation#Duke to Market#rank-1
	RGB Salient Object Detection#HKU-IS#MAE
	Image Super-Resolution#Set5 - 4x upscaling#PSNR
	Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#FID
	Unsupervised Video Object Segmentation#DAVIS 2016#J&F
	Crowd Counting#WorldExpo’10#Average MAE
	Dense Object Detection#SKU-110K#AP75
	Face Alignment#AFLW2000-3D#Mean NME
	Generalized Zero-Shot Learning#SUN Attribute#Harmonic mean
	Real-Time Semantic Segmentation#CamVid#Time (ms)
	Emotion Recognition in Context#EMOTIC#mAP
	Few-Shot Image Classification#OMNIGLOT - 1-Shot, 20-way#Accuracy
	3D Human Pose Estimation#Human3.6M#Using 2D ground-truth joints
	Spoken language identification#LRE07#30 sec
	Recommendation Systems#MovieLens 20M#Recall@50
	Stochastic Optimization#CIFAR-10 WRN-28-10 - 200 Epochs#Accuracy
	Time Series Classification#PhysioNet Challenge 2012#AUC Stdev
	Node Classification#PubMed with Public Split: fixed 20 nodes per class#Accuracy
	summarization#DUC 2004 Task 1#ROUGE-L
	6D Pose Estimation using RGB#LineMOD#Accuracy (ADD)
	Person Search#CUHK-SYSU#Top-1
	dependency_parsing#benchmark Vietnamese dependency treebank VnDT#LAS
	3D Human Pose Estimation#MPI-INF-3DHP#3DPCK
	summarization#DUC 2004 Task 1#ROUGE-2
	summarization#DUC 2004 Task 1#ROUGE-1
	Node Classification#PubMed (0.05%)#Accuracy
	Link Prediction#WN18RR#Hits@10
	Visual Question Answering#VCR (QA-R) test#Accuracy
	Question Answering#Natural Questions (long)#F1
	Person Re-Identification#CUHK03 detected#MAP
	Atari Games#Atari 2600 Surround#Score
	RGB-D Salient Object Detection#SIP#max F-Measure
	Atari Games#Atari 2600 Boxing#Score
	Visual Question Answering#DocVQA test#ANLS
	Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Mean)
	Traffic Prediction#METR-LA#MAE @ 12 step
	Action Segmentation#GTEA#F1@25%
	Person Re-Identification#PRID2011#Rank-20
	Scene Text Detection#COCO-Text#F-Measure
	Atari Games#Atari 2600 Bank Heist#Score
	Node Classification#Cora (1%)#Accuracy
	Monocular 3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
	Neural Network Compression#CIFAR-10#Size (MB)
	Object Counting#COCO count-test#mRMSE-nz
	Question Answering#SQuAD2.0#EM
	Facial Expression Recognition#FER2013#Accuracy
	Image Classification#STL-10#Percentage correct
	Question Answering#SQuAD2.0#F1
	Unsupervised Domain Adaptation#Market to MSMT#mAP
	machine_translation#The IWSLT 2015 Evaluation Campaign#BLEU
	Scene Text Detection#ICDAR 2015#F-Measure
	Text Classification#IMDb#Accuracy (2 classes)
	Facial Landmark Detection#300W#NME
	Unsupervised Domain Adaptation#Market to MSMT#rank-5
	Language Modelling#Text8#Number of params
	Unsupervised Domain Adaptation#Market to MSMT#rank-1
	Link Prediction#FB15k#Hits@1
	Node Classification#Texas#Accuracy
	Atari Games#Atari 2600 River Raid#Score
	Cross-View Image-to-Image Translation#Dayton (64×64) - aerial-to-ground#SSIM
	Link Prediction#FB15k#Hits@3
	Cross-Modal Retrieval#Flickr30k#Image-to-text R@10
	Supervised Video Summarization#TvSum#F1-score (Canonical)
	Few-Shot Image Classification#OMNIGLOT - 5-Shot, 5-way#Accuracy
	Sequential Image Classification#Sequential CIFAR-10#Unpermuted Accuracy
	Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Recall)
	Person Re-Identification#DukeMTMC-reID#Rank-1
	Cross-Modal Retrieval#COCO 2014#Text-to-image R@10
	Semantic Segmentation#Cityscapes test#Category mIoU
	Person Re-Identification#DukeMTMC-reID#Rank-5
	Image Super-Resolution#BSD100 - 2x upscaling#SSIM
	Word Sense Disambiguation#Words in Context#Accuracy
	Action Recognition#NTU RGB+D 120#Accuracy (Cross-Setup)
	Node Classification#Pubmed#Training Split
	Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.1-0.9)
	Layout-to-Image Generation#COCO-Stuff 64x64#Inception Score
	Atari Games#Atari 2600 Venture#Score
	Text Generation#MATH#Average Accuracy
	Grayscale Image Denoising#BSD68 sigma15#PSNR
	Visual Question Answering#VQA v2 test-std#other
	Question Answering#CoQA#Out-of-domain
	Semantic Textual Similarity#MRPC#Accuracy
	Human-Object Interaction Detection#HICO-DET#Time Per Frame (ms)
	Line Segment Detection#York Urban Dataset#sAP5
	Recommendation Systems#MovieLens 20M#nDCG@100
	Question Answering#RACE#RACE-h
	Question Answering#RACE#RACE-m
	Semantic Segmentation#Cityscapes test#Mean IoU (class)
	Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.1-0.5)
	Superpixel Image Classification#75 Superpixel MNIST#Classification Error
	Commonsense Reasoning for RL#commonsense-rl#Avg #Steps
	Time Series Classification#PhysioNet Challenge 2012#AUC
	Pose Transfer#Deep-Fashion#SSIM
	Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Decay)
	Image-to-Image Translation#Cityscapes Photo-to-Labels#Per-pixel Accuracy
	text_classification#TREC#Error
	Medical Image Segmentation#Kvasir-SEG#Average MAE
	Speech Enhancement#CHiME-3#SDR
	Head Pose Estimation#AFLW2000#MAE
	Gesture-to-Gesture Translation#Senz3D#IS
	Visual Question Answering#GQA Test2019#Plausibility
	3D Object Detection#KITTI Cars Easy#AP
	Image Clustering#MNIST-test#Accuracy
	Time Series Classification#UWave#Accuracy
	Visual Dialog#Visual Dialog v1.0 test-std#MRR (x 100)
	Image-to-Image Translation#Cityscapes Photo-to-Labels#Class IOU
	Task-Oriented Dialogue Systems#KVRET#BLEU
	word_sense_disambiguation#SemEval 2015#F1
	Image Relighting#VIDIT’20 validation set#LPIPS
	Weakly-supervised 3D Human Pose Estimation#Human3.6M#Number of Views
	JPEG Artifact Correction#ICB (Quality 10 Grayscale)#PSNR-B
	Image Classification#ImageNet#Top 5 Accuracy
	Image Clustering#CIFAR-10#Accuracy
	Atari Games#Atari 2600 Up and Down#Score
	Depth Estimation#NYU-Depth V2#RMS
	Person Re-Identification#DukeMTMC-reID#MAP
	Image Super-Resolution#WebFace - 8x upscaling#PSNR
	Graph Classification#NCI1#Accuracy
	Deblurring#GoPro#SSIM
	Hate Speech Detection#HateXplain#Macro F1
	Visual Question Answering#GQA Test2019#Validity
	machine_translation#WMT 2014 EN-DE#BLEU
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#LPIPS
	Visual Dialog#VisDial v0.9 val#MRR
	Keyword Spotting#Google Speech Commands#Google Speech Commands V2 12
	Grammatical Error Detection#FCE#F0.5
	Facial Expression Recognition#AffectNet#Accuracy (7 emotion)
	Emotion Recognition in Conversation#IEMOCAP#F1
	Link Prediction#FB15k#Hits@10
	JPEG Artifact Correction#ICB (Quality 10 Grayscale)#PSNR
	Semi-Supervised Image Classification#CIFAR-10, 1000 Labels#Accuracy
	Relation Extraction#NYT#F1
	Semi-Supervised Semantic Segmentation#Pascal VOC 2012 12.5% labeled#Validation mIoU
	Scene Text Detection#COCO-Text#Precision
	Keyword Spotting#Google Speech Commands#Google Speech Commands V2 35
	Weakly Supervised Action Localization#THUMOS’14#mAP@0.5
	Object Detection#COCO test-dev#box AP
	Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: domain specific
	Image Super-Resolution#BSD100 - 4x upscaling#PSNR
	Atari Games#Atari 2600 Name This Game#Score
	Relation Extraction#ACE 2005#NER Micro F1
	Data-to-Text Generation#LDC2017T10#BLEU
	Self-Supervised Action Recognition#UCF101#Pre-Training Dataset
	Pose Estimation#COCO test-dev#AR
	Pose Estimation#COCO test-dev#AP
	Graph Classification#NEURON-MULTI#Accuracy
	Relation Extraction#ACE 2005#Sentence Encoder
	Image Generation#ImageNet 32x32#bpd
	relation_prediction#FB15K-237#MRR
	Action Recognition#HMDB-51#Average accuracy of 3 splits
	Action Recognition#AVA v2.2#mAP
	ccg_supertagging#CCGBank#Accuracy
	Data-to-Text Generation#E2E NLG Challenge#BLEU
	Atari Games#Atari 2600 Star Gunner#Score
	Visual Question Answering#VCR (Q-A) test#Accuracy
	Scene Text Detection#SCUT-CTW1500#F-Measure
	Video Semantic Segmentation#Cityscapes val#mIoU
	Action Recognition#Something-Something V1#Top 1 Accuracy
	Link Prediction#FB15k-237#Hits@3
	Link Prediction#FB15k-237#Hits@1
	Text Classification#Yahoo! Answers#Accuracy
	Partial Domain Adaptation#Office-Home#Accuracy (%)
	6D Pose Estimation using RGB#Occlusion LineMOD#Mean ADD
	Image Generation#CIFAR-10#bits/dimension
	Graph Regression#ZINC-500k#MAE
	Intent Detection#ATIS#F1
	Human Part Segmentation#PASCAL-Part#mIoU
	relation_prediction#WN18RR#H@10
	Image Retrieval with Multi-Modal Query#MIT-States#Recall@10
	Intent Detection#SNIPS#Slot F1 Score
	taxonomy_learning#SemEval 2018#P@5
	Video Instance Segmentation#YouTube-VIS validation#mask AP
	Face Detection#WIDER Face (Hard)#AP
	Image-to-Image Translation#ADE20K-Outdoor Labels-to-Photos#mIoU
	Scene Text Detection#ICDAR 2013#Recall
	Unsupervised Person Re-Identification#Market-1501#Rank-1
	dependency_parsing#Penn Treebank#POS
	question_answering#CNN / Daily Mail#Accuracy on CNN
	Optical Flow Estimation#KITTI 2015#Fl-all
	Semantic Segmentation#PASCAL VOC 2012 val#mIoU
	Named Entity Recognition#CoNLL++#F1
	Question Answering#bAbi#Accuracy (trained on 1k)
	Time Series Classification#Libras#NLL
	Dense Pixel Correspondence Estimation#HPatches#Viewpoint II AEPE
	Image Clustering#MNIST-full#NMI
	Machine Translation#WMT2015 English-German#BLEU score
	3D Face Reconstruction#NoW Benchmark#Mean Reconstruction Error (mm)
	Semantic Segmentation#PASCAL VOC 2012 test#Mean IoU
	Relation Extraction#CoNLL04#RE+ Macro F1
	Pose Estimation#UPenn Action#Mean PCK@0.2
	Conversational Response Selection#DSTC7 Ubuntu#1-of-100 Accuracy
	Image Classification#WebVision-1000#Top-1 Accuracy
	Atari Games#Atari 2600 Yars Revenge#Score
	JPEG Artifact Correction#ICB (Quality 10 Color)#PSNR-B
	Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.5
	Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Mean)
	Image Super-Resolution#Urban100 - 2x upscaling#SSIM
	Visual Question Answering#GQA Test2019#Open
	Single Image Deraining#Rain100L#SSIM
	Entity Linking#WiC-TSV#Task 3 Accuracy: general purpose
	Scene Text Detection#MSRA-TD500#F-Measure
	Mortality Prediction#MIMIC-III#F1 score
	Video Retrieval#MSR-VTT-1kA#text-to-video Mean Rank
	Node Classification#Actor#Accuracy
	language_modeling#Penn Treebank#Test perplexity
	Gesture-to-Gesture Translation#Senz3D#PSNR
	Image Generation#CLEVR#FID-5k-training-steps
	Self-Supervised Image Classification#ImageNet#Top 1 Accuracy (kNN, k=20)
	Fine-Grained Image Classification#CUB-200-2011#Accuracy
	Lung Nodule Classification#LIDC-IDRI#Accuracy
	Link Prediction#Pubmed#AP
	Pedestrian Detection#CityPersons#Reasonable MR^-2
	Link Prediction#WN18#MRR
	Face Identification#MegaFace#Accuracy
	Domain Adaptation#VisDA2017#Accuracy
	Face Verification#MegaFace#Accuracy
	Question Answering#YahooCQA#MRR
	Scene Text Detection#COCO-Text#Recall
	Video Frame Interpolation#Vimeo90k#PSNR
	RGB Salient Object Detection#DUT-OMRON#MAE
	Image Retrieval with Multi-Modal Query#MIT-States#Recall@5
	Image Retrieval with Multi-Modal Query#MIT-States#Recall@1
	Gesture-to-Gesture Translation#NTU Hand Digit#PSNR
	Image Retrieval#SOP#R@1
	Multi-Label Classification#MS-COCO#mAP
	Keyword Spotting#Google Speech Commands#Google Speech Commands V1 12
	3D Human Pose Estimation#MPI-INF-3DHP#AUC
	Lipreading#CAS-VSR-W1k (LRW-1000)#Top-1 Accuracy
	Weakly-Supervised Semantic Segmentation#PASCAL VOC 2012 val#Mean IoU
	Machine Translation#WMT2016 German-English#BLEU score
	Video Retrieval#MSR-VTT#video-to-text R@5
	Visual Question Answering#MSRVTT-QA#Accuracy
	Domain Generalization#ImageNet-A#Top-1 accuracy %
	Action Recognition#Jester#Val
	Image Super-Resolution#Set5 - 8x upscaling#PSNR
	Semi-Supervised Image Classification#STL-10, 1000 Labels#Accuracy
	Image Super-Resolution#Manga109 - 8x upscaling#PSNR
	Visual Question Answering#VQA v2 test-std#overall
	RGB-D Salient Object Detection#DES#max F-Measure
	Image Clustering#Fashion-MNIST#Accuracy
	Semantic Segmentation#PASCAL Context#mIoU
	Semantic Similarity#SICK#MSE
	Retinal Vessel Segmentation#STARE#F1 score
	Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#FID
	Machine Translation#WMT2014 English-German#BLEU score
	3D Object Detection#KITTI Cars Hard val#AP
	Image Super-Resolution#Urban100 - 4x upscaling#PSNR
	3D Human Pose Estimation#Human3.6M#Multi-View or Monocular
	Relation Extraction#CoNLL04#NER Macro F1
	Image Super-Resolution#BSD100 - 4x upscaling#MOS
	Semi-Supervised Image Classification#ImageNet - 1% labeled data#Top 5 Accuracy
	Weakly-Supervised Semantic Segmentation#PASCAL VOC 2012 test#Mean IoU
	Node Classification#PATTERN 100k#Accuracy (%)
	Node Classification#MAG240M-LSC#Validation Accuracy
	Image Generation#FFHQ#FID-10k-training-steps
	relation_prediction#WN18RR#MRR
	Fine-Grained Image Classification#DF20#Top-1
	Fine-Grained Image Classification#DF20#Top-3
	Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: all
	3D Multi-Person Pose Estimation (root-relative)#MuPoTS-3D#3DPCK
	Medical Image Segmentation#Kvasir-SEG#mean Dice
	Video Retrieval#MSR-VTT#text-to-video R@1
	RGB-D Salient Object Detection#LFSD#S-Measure
	Semantic Textual Similarity#STS16#Spearman Correlation
	RGB-D Salient Object Detection#STERE#max F-Measure
	Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Recall)
	Sentiment Analysis#TweetEval#Emotion
	Neural Architecture Search#CIFAR-10#FLOPS
	Atari Games#Atari 2600 Kangaroo#Score
	Lane Detection#TuSimple#F1 score
	Session-Based Recommendations#Diginetica#Hit@20
	Atari Games#Atari 2600 Seaquest#Score
	Neural Architecture Search#NAS-Bench-201, CIFAR-10#Search time (s)
	Graph Classification#PROTEINS#Accuracy
	Common Sense Reasoning#SWAG#Test
	Multi-Object Tracking#MOT16#MOTA
	Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Decay)
	Visual Question Answering#VQA v2 test-std#number
	Object Detection#COCO minival#APL
	Object Detection#COCO minival#APM
	Object Detection#COCO minival#APS
	Atari Games#Atari 2600 Krull#Score
	JPEG Artifact Correction#LIVE1 (Quality 10 Color)#PSNR
	Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-German#Accuracy
	RGB-D Salient Object Detection#DES#max E-Measure
	Node Classification#PubMed (0.1%)#Accuracy
	Link Prediction#WN18#MR
	Semi-Supervised Image Classification#CIFAR-10, 40 Labels#Percentage error
	Scene Text Detection#ICDAR 2013#F-Measure
	Image Super-Resolution#Set5 - 2x upscaling#SSIM
	Transfer Learning#Office-Home#Accuracy
	JPEG Artifact Correction#ICB (Quality 20 Color)#PSNR-B
	Image Classification#smallNORB#Classification Error
	Image Super-Resolution#Manga109 - 2x upscaling#SSIM
	Object Detection#USB (Standard USB 1.0 protocol)#mCAP
	Deblurring#RealBlur-J (trained on GoPro)#PSNR (sRGB)
	JPEG Artifact Correction#ICB (Quality 20 Grayscale)#PSNR-B
	Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Mean Acc (Restaurant + Laptop)
	Node Classification#Chameleon#Accuracy
	Question Answering#CoQA#Overall
	Visual Object Tracking#VOT2017/18#Expected Average Overlap (EAO)
	Hate Speech Detection#HateXplain#AUROC
	Node Classification#CiteSeer (0.5%)#Accuracy
	Age-Invariant Face Recognition#CACDVS#Accuracy
	Layout-to-Image Generation#COCO-Stuff 64x64#FID
	Image Clustering#STL-10#NMI
	JPEG Artifact Correction#ICB (Quality 20 Grayscale)#SSIM
	Graph Classification#D&D#Accuracy
	Text Summarization#GigaWord#ROUGE-L
	RGB Salient Object Detection#DUTS-TE#MAE
	Natural Language Inference#SNLI#% Test Accuracy
	Text Summarization#GigaWord#ROUGE-1
	Text Summarization#GigaWord#ROUGE-2
	Unsupervised Domain Adaptation#Market to MSMT#rank-10
	Surgical tool detection#Cholec80#mAP
	RGB-D Salient Object Detection#NLPR#S-Measure
	Semantic Textual Similarity#STS15#Spearman Correlation
	Named Entity Recognition#Ontonotes v5 (English)#F1
	Unsupervised Domain Adaptation#Market to Duke#rank-1
	Heterogeneous Node Classification#DBLP (PACT) 14k#Micro-F1 (20% training data)
	Unsupervised Domain Adaptation#Market to Duke#rank-5
	Atari Games#Atari 2600 Berzerk#Score
	Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#LLE
	Image Classification#ImageNet#Number of params
	Face Detection#WIDER Face (Easy)#AP
	Action Classification#Kinetics-600#Top-1 Accuracy
	Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#SSIM
	question_answering#Quasar#F1 (Quasar-T)
	Visual Object Tracking#OTB-2015#AUC
	Text Simplification#Newsela#SARI
	Action Classification#Kinetics-700#Top-5 Accuracy
	Language Modelling#Text8#Bit per Character (BPC)
	Image Super-Resolution#Urban100 - 8x upscaling#PSNR
	Out-of-Distribution Detection#STL-10#Percentage correct
	Dense Pixel Correspondence Estimation#HPatches#Viewpoint I AEPE
	Object Detection#COCO minival#AP50
	Semi-Supervised Semantic Segmentation#Pascal VOC 2012 5% labeled#Validation mIoU
	Node Classification#Cora#Accuracy
	Aesthetics Quality Assessment#AVA#Accuracy
	Named Entity Recognition#ACE 2005#F1
	Instance Segmentation#COCO test-dev#APS
	taxonomy_learning#SemEval 2018#MRR
	Fake News Detection#FNC-1#Per-class Accuracy (Disagree)
	Instance Segmentation#COCO test-dev#APM
	Instance Segmentation#COCO test-dev#APL
	Entity Alignment#DBP15k zh-en#Hits@1
	Object Detection#COCO minival#AP75
	language_modeling#1B Words / Google Billion Word benchmark#Number of params
	Action Segmentation#GTEA#F1@50%
	Action Classification#Moments in Time#Top 5 Accuracy
	Question Answering#Children's Book Test#Accuracy-NE
	Cross-Modal Retrieval#COCO 2014#Image-to-text R@1
	Action Recognition#Sports-1M#Video hit@1
	Action Recognition#Sports-1M#Video hit@5
	Time Series Classification#PEMS#Accuracy
	Real-Time Semantic Segmentation#NYU Depth v2#Speed(ms/f)
	Cross-Modal Retrieval#COCO 2014#Image-to-text R@5
	Word Sense Disambiguation#Supervised:#Senseval 3
	Word Sense Disambiguation#Supervised:#Senseval 2
	Image-to-Image Translation#Cityscapes Labels-to-Photo#Per-class Accuracy
	Image Super-Resolution#Manga109 - 4x upscaling#PSNR
	Retinal Vessel Segmentation#CHASE_DB1#AUC
	Atari Games#Atari 2600 Frostbite#Score
	Vision and Language Navigation#VLN Challenge#oracle success
	Relation Extraction#WebNLG#F1
	Drug Discovery#Tox21#AUC
	Image Generation#FFHQ 256 x 256#FID
	Question Answering#TriviaQA#F1
	Semi-Supervised Semantic Segmentation#Pascal VOC 2012 2% labeled#Validation mIoU
	Semantic Textual Similarity#STS12#Spearman Correlation
	Fine-Grained Image Classification#DF20#F1 - macro
	Few-Shot Image Classification#FC100 5-way (1-shot)#Accuracy
	Speech Recognition#swb_hub_500 WER fullSWBCH#Percentage error
	Speech Recognition#MediaSpeech#WER for French
	Image Classification#EMNIST-Letters#Accuracy
	Time Series Classification#NetFlow#Accuracy
	Text Style Transfer#Yelp Review Dataset (Small)#G-Score (BLEU, Accuracy)
	Self-Supervised Action Recognition#HMDB51#Top-1 Accuracy
	Semantic Textual Similarity#STS13#Spearman Correlation
	Link Prediction#Cora#AP
	Relation Extraction#SemEval-2010 Task 8#F1
	Incremental Learning#CIFAR-100 - 50 classes + 5 steps of 10 classes#Average Incremental Accuracy
	Cross-View Image-to-Image Translation#cvusa#SSIM
	Speech Recognition#MediaSpeech#WER for Arabic
	Person Search#PRW#Top-1
	Image Clustering#CIFAR-100#NMI
	Face Verification#YouTube Faces DB#Accuracy
	Named Entity Recognition#CoNLL 2002 (Dutch)#F1
	Image Super-Resolution#VggFace2 - 8x upscaling#PSNR
	Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Recall
	Synthetic-to-Real Translation#GTAV-to-Cityscapes Labels#mIoU
	Fine-Grained Image Classification#Oxford-IIIT Pets#Accuracy
	Image Classification#Fashion-MNIST#Percentage error
	Question Answering#Children's Book Test#Accuracy-CN
	Action Recognition#Something-Something V2#Top-5 Accuracy
	Atari Games#Atari 2600 Fishing Derby#Score
	Question Answering#NarrativeQA#BLEU-4
	Question Answering#NarrativeQA#BLEU-1
	Text Classification#20NEWS#Accuracy
	Image Denoising#DND#PSNR (sRGB)
	Visual Object Tracking#VOT2016#Expected Average Overlap (EAO)
	Semi-Supervised Image Classification#SVHN, 500 Labels#Accuracy
	sentiment_analysis#IMDb#Accuracy
	Unsupervised Person Re-Identification#DukeMTMC-reID#Rank-10
	Nested Mention Recognition#ACE 2005#F1
	Domain Adaptation#SVHN-to-MNIST#Accuracy
	Object Detection#COCO minival#box AP
	Action Recognition#EPIC-KITCHENS-100#GFLOPs
	Music Transcription#MusicNet#APS
	Semi-Supervised Image Classification#CIFAR-10, 4000 Labels#Accuracy
	Hate Speech Detection#Ethos MultiLabel#Hamming Loss
	Action Classification#Kinetics-600#GFLOPs
	Semi-Supervised Semantic Segmentation#Cityscapes 25% labeled#Validation mIoU
	Face Alignment#300W#Fullset (public)
	unknown