Classification
- Accuracy
- Precision
- Recall
- F1-score
Regression
- Mean Squared Error (MSE)
- Mean Absolute Error (MAE)
- R-squared (R^2)
Ranking
- Mean Reciprocal Rank (MRR)
- Normalized Discounted Cumulative Gain (NDCG)
- Mean Average Precision (MAP)
Clustering
- Silhouette Score
- Calinski-Harabasz Index
- Adjusted Rand Index
Segmentation
- Intersection over Union (IoU)
- Dice Coefficient
- Pixel Accuracy
Object Detection
- Precision
- Recall
- Mean Average Precision (mAP)
Generative Models
- Inception Score
- Fréchet Inception Distance (FID)
- Kernel Inception Distance (KID)
Computer Vision
- Peak Signal-to-Noise Ratio (PSNR)
- Structural Similarity Index (SSIM)
- Visual Information Fidelity (VIF)
Natural Language Processing
- BLEU (Bilingual Evaluation Understudy)
- ROUGE (Recall-Oriented Understudy for Gisting Evaluation)
- Perplexity
Recommendation Systems
- Hit Rate
- Average Reciprocal Hit-Rank (ARHR)
- Normalized Discounted Cumulative Gain (NDCG)
Anomaly/Outlier Detection
- Area Under the ROC Curve (AU-ROC)
- Area Under the Precision-Recall Curve (AU-PRC)
- F1-score
Survival Analysis
- Concordance Index
- Brier Score
- Integrated Discrimination Improvement (IDI)
Reinforcement Learning
- Total Reward
- Average Return
- Win Rate