You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: PLAN.md
+6Lines changed: 6 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -64,6 +64,12 @@ DataMetaMap aims to compare datasets within a unified vector space to identify s
64
64
65
65
-**Evaluation Metrics Definition**
66
66
Define quantitative metrics to evaluate embedding quality and similarity measurement accuracy.
67
+
68
+
Description (done by Stepanov Ilya):
69
+
- Define cosine similarity, Euclidean distance and kernel-based distance as core metrics to evaluate geometric separability and structural relationships between dataset embeddings in the latent space
70
+
- Define Maximum Mean Discrepancy (MMD) metric as described in the original paper
71
+
- Ensure that all embedding methods and baselines will be evaluate using all metrics so that comparison across methods is consistent and reproducible
72
+
67
73
68
74
-**Planning and Specifications**
69
75
Define technical specifications and success criteria based on research findings and data availability.
0 commit comments