Skip to content

Commit 12ce596

Browse files
authored
Merge pull request #199 from IINemo/monotonic_calibration
Monotonic calibration
2 parents 6fda7e3 + 4b58a9c commit 12ce596

File tree

15 files changed

+1249
-272
lines changed

15 files changed

+1249
-272
lines changed

.gitignore

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
src/lm_polygraph/app/node_modules
33
**/.DStore
44
workdir/
5-
**/normalization/*
65

76
*.npy
87
*.csv
Lines changed: 58 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,58 @@
1+
hydra:
2+
run:
3+
dir: ${cache_path}/${now:%Y-%m-%d}/${now:%H-%M-%S}
4+
5+
cache_path: ./workdir/output/normalization
6+
save_path: '${hydra:run.dir}'
7+
8+
normalization_methods:
9+
- min_max
10+
- quantile
11+
- binned_pcc
12+
- isotonic_pcc
13+
14+
num_bins: 100
15+
16+
man_paths:
17+
- polygraph_tacl_stablelm12b_coqa_train.man
18+
- polygraph_tacl_stablelm12b_gsm8k_train.man
19+
- polygraph_tacl_stablelm12b_mmlu_train.man
20+
- polygraph_tacl_stablelm12b_triviaqa_train.man
21+
- polygraph_tacl_stablelm12b_wmt14_train.man
22+
- polygraph_tacl_stablelm12b_wmt19_train.man
23+
- polygraph_tacl_stablelm12b_xsum_train.man
24+
25+
ue_method_names:
26+
- MaximumSequenceProbability
27+
- Perplexity
28+
- MeanTokenEntropy
29+
- MeanPointwiseMutualInformation
30+
- MeanConditionalPointwiseMutualInformation
31+
- CCP
32+
- PTrue
33+
- PTrueSampling
34+
- MonteCarloSequenceEntropy
35+
- MonteCarloNormalizedSequenceEntropy
36+
- LexicalSimilarity_rouge1
37+
- LexicalSimilarity_rouge2
38+
- LexicalSimilarity_rougeL
39+
- LexicalSimilarity_BLEU
40+
- NumSemSets
41+
- EigValLaplacian_NLI_score_entail
42+
- EigValLaplacian_NLI_score_contra
43+
- EigValLaplacian_Jaccard_score
44+
- DegMat_NLI_score_entail
45+
- DegMat_NLI_score_contra
46+
- DegMat_Jaccard_score
47+
- Eccentricity_NLI_score_entail
48+
- Eccentricity_NLI_score_contra
49+
- Eccentricity_Jaccard_score
50+
- SemanticEntropy
51+
- SAR
52+
- TokenSAR
53+
- SentenceSAR
54+
- RenyiNeg
55+
- FisherRao
56+
57+
gen_metric_names:
58+
- AlignScore

0 commit comments

Comments
 (0)