Classification

  • Accuracy
  • Precision
  • Recall
  • F1-score

Regression

  • Mean Squared Error (MSE)
  • Mean Absolute Error (MAE)
  • R-squared (R^2)

Ranking

  • Mean Reciprocal Rank (MRR)
  • Normalized Discounted Cumulative Gain (NDCG)
  • Mean Average Precision (MAP)

Clustering

  • Silhouette Score
  • Calinski-Harabasz Index
  • Adjusted Rand Index

Segmentation

  • Intersection over Union (IoU)
  • Dice Coefficient
  • Pixel Accuracy

Object Detection

  • Precision
  • Recall
  • Mean Average Precision (mAP)

Generative Models

  • Inception Score
  • Fréchet Inception Distance (FID)
  • Kernel Inception Distance (KID)

Computer Vision

  • Peak Signal-to-Noise Ratio (PSNR)
  • Structural Similarity Index (SSIM)
  • Visual Information Fidelity (VIF)

Natural Language Processing

  • BLEU (Bilingual Evaluation Understudy)
  • ROUGE (Recall-Oriented Understudy for Gisting Evaluation)
  • Perplexity

Recommendation Systems

  • Hit Rate
  • Average Reciprocal Hit-Rank (ARHR)
  • Normalized Discounted Cumulative Gain (NDCG)

Anomaly/Outlier Detection

  • Area Under the ROC Curve (AU-ROC)
  • Area Under the Precision-Recall Curve (AU-PRC)
  • F1-score

Survival Analysis

  • Concordance Index
  • Brier Score
  • Integrated Discrimination Improvement (IDI)

Reinforcement Learning

  • Total Reward
  • Average Return
  • Win Rate