Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis by Xiaomin-HUANG · Pull Request #304 · urchade/GLiNER

Xiaomin-HUANG · 2025-11-13T16:31:02Z

Added Macro F1 and per-label metrics to complement the existing Micro F1 score, enabling better visibility into model performance across different entity types.

What's New

Macro F1: Unweighted average across all labels (0.51)
Per-label breakdown: Precision, Recall, and F1 for each entity type
Formatted table: Sorted output for quick identification of best/worst performers

Enhenced output : {
        
        "per_class":{"tag1":{"precision":float, "recall":float,"f_score":float},
                    "tag2":{}...
                    },
        "micro":{"precision":float, "recall":float,"f_score":float},
        "macro":{"precision":float, "recall":float,"f_score":float},
        }

Formatted table eg :

…cores per label

Improve evaluation output by add more info : Macro & Micro scores & S…

985d8bd

…cores per label

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis#304

Enhanced Evaluation Metrics with Macro/Micro Scores and Per-Label Analysis#304
Xiaomin-HUANG wants to merge 1 commit into
urchade:mainfrom
Xiaomin-HUANG:feature/evaluation

Xiaomin-HUANG commented Nov 13, 2025

Labels

1 participant

Conversation

Xiaomin-HUANG commented Nov 13, 2025

Labels

1 participant