Broomva commited on
Commit
fba9bbe
1 Parent(s): d3194ee

End of training

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [t5-large](https://huggingface.co/t5-large) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.0832
21
- - Bleu: 0.952
22
- - Gen Len: 17.9397
23
 
24
  ## Model description
25
 
@@ -45,15 +45,15 @@ The following hyperparameters were used during training:
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - lr_scheduler_warmup_steps: 10
48
- - num_epochs: 3
 
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
54
- | 1.3886 | 1.0 | 12889 | 1.2435 | 0.7737 | 17.9241 |
55
- | 1.3043 | 2.0 | 25778 | 1.1197 | 0.9071 | 17.9235 |
56
- | 1.2024 | 3.0 | 38667 | 1.0832 | 0.952 | 17.9397 |
57
 
58
 
59
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [t5-large](https://huggingface.co/t5-large) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: nan
21
+ - Bleu: 0.6661
22
+ - Gen Len: 17.2141
23
 
24
  ## Model description
25
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - lr_scheduler_warmup_steps: 10
48
+ - num_epochs: 15
49
+ - mixed_precision_training: Native AMP
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
55
+ | 0.0 | 1.0 | 7668 | nan | 0.6661 | 17.2141 |
56
+ | 0.0 | 2.0 | 15336 | nan | 0.6661 | 17.2141 |
 
57
 
58
 
59
  ### Framework versions