Training in progress, epoch 1

Browse files

Files changed (4) hide show

logs/events.out.tfevents.1700313495.0a848c80699a.8533.0 +3 -0
nncf_output.log +45 -63
pytorch_model.bin +1 -1
training_args.bin +1 -1

logs/events.out.tfevents.1700313495.0a848c80699a.8533.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0789b7d98277a9ee5d21fac6763b32e22a4a43aff0183db36f09da5dfcd2e06e
+size 4715

nncf_output.log CHANGED Viewed

@@ -45,53 +45,6 @@ INFO:nncf:Collecting tensor statistics |█████           | 1 / 3
 INFO:nncf:Collecting tensor statistics |██████████      | 2 / 3
 INFO:nncf:Collecting tensor statistics |████████████████| 3 / 3
 INFO:nncf:Compiling and loading torch extension: quantized_functions_cuda...
-INFO:nncf:Not adding activation input quantizer for operation: 7 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[position_embeddings]/embedding_0
-INFO:nncf:Not adding activation input quantizer for operation: 4 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[word_embeddings]/embedding_0
-INFO:nncf:Not adding activation input quantizer for operation: 5 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[token_type_embeddings]/embedding_0
-INFO:nncf:Not adding activation input quantizer for operation: 6 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 8 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/__iadd___0
-INFO:nncf:Not adding activation input quantizer for operation: 9 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 10 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/Dropout[dropout]/dropout_0
-INFO:nncf:Not adding activation input quantizer for operation: 23 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfAttention[self]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 26 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
-INFO:nncf:Not adding activation input quantizer for operation: 32 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 33 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 37 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 38 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 51 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfAttention[self]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 54 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
-INFO:nncf:Not adding activation input quantizer for operation: 60 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 61 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 65 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 66 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 79 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfAttention[self]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 82 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
-INFO:nncf:Not adding activation input quantizer for operation: 88 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 89 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 93 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 94 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 107 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfAttention[self]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 110 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
-INFO:nncf:Not adding activation input quantizer for operation: 116 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 117 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 121 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 122 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 135 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfAttention[self]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 138 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
-INFO:nncf:Not adding activation input quantizer for operation: 144 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 145 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 149 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 150 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 163 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfAttention[self]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 166 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
-INFO:nncf:Not adding activation input quantizer for operation: 172 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 173 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Not adding activation input quantizer for operation: 177 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertOutput[output]/__add___0
-INFO:nncf:Not adding activation input quantizer for operation: 178 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
-INFO:nncf:Collecting tensor statistics |█████           | 1 / 3
-INFO:nncf:Collecting tensor statistics |██████████      | 2 / 3
-INFO:nncf:Collecting tensor statistics |████████████████| 3 / 3
-INFO:nncf:Compiling and loading torch extension: quantized_functions_cuda...
 INFO:nncf:Finished loading torch extension: quantized_functions_cuda
 WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
 NNCF relies on custom-wrapping the `forward` call in order to function properly.
@@ -170,19 +123,48 @@ model.nncf.set_original_unbound_forward(fn)
 if `fn` has an unbound 0-th `self` argument, or
 with model.nncf.temporary_bound_original_forward(fn): ...
 if `fn` already had 0-th `self` argument bound or never had it in the first place.
-WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
-NNCF relies on custom-wrapping the `forward` call in order to function properly.
-Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
-If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
-model.nncf.set_original_unbound_forward(fn)
-if `fn` has an unbound 0-th `self` argument, or
-with model.nncf.temporary_bound_original_forward(fn): ...
-if `fn` already had 0-th `self` argument bound or never had it in the first place.
-WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
-NNCF relies on custom-wrapping the `forward` call in order to function properly.
-Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
-If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
-model.nncf.set_original_unbound_forward(fn)
-if `fn` has an unbound 0-th `self` argument, or
-with model.nncf.temporary_bound_original_forward(fn): ...
-if `fn` already had 0-th `self` argument bound or never had it in the first place.

 INFO:nncf:Collecting tensor statistics |██████████      | 2 / 3
 INFO:nncf:Collecting tensor statistics |████████████████| 3 / 3
 INFO:nncf:Compiling and loading torch extension: quantized_functions_cuda...
 INFO:nncf:Finished loading torch extension: quantized_functions_cuda
 WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
 NNCF relies on custom-wrapping the `forward` call in order to function properly.
 if `fn` has an unbound 0-th `self` argument, or
 with model.nncf.temporary_bound_original_forward(fn): ...
 if `fn` already had 0-th `self` argument bound or never had it in the first place.
+INFO:nncf:Statistics of the quantization algorithm:
+Epoch 1 |+--------------------------------+-------+
+Epoch 1 ||        Statistic's name        | Value |
+Epoch 1 |+================================+=======+
+Epoch 1 || Ratio of enabled quantizations | 100   |
+Epoch 1 |+--------------------------------+-------+
+Epoch 1 |
+Epoch 1 |Statistics of the quantization share:
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 ||         Statistic's name         |       Value        |
+Epoch 1 |+==================================+====================+
+Epoch 1 || Symmetric WQs / All placed WQs   | 100.00 % (38 / 38) |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Asymmetric WQs / All placed WQs  | 0.00 % (0 / 38)    |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Signed WQs / All placed WQs      | 100.00 % (38 / 38) |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Unsigned WQs / All placed WQs    | 0.00 % (0 / 38)    |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Per-tensor WQs / All placed WQs  | 0.00 % (0 / 38)    |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Per-channel WQs / All placed WQs | 100.00 % (38 / 38) |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Placed WQs / Potential WQs       | 70.37 % (38 / 54)  |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Symmetric AQs / All placed AQs   | 27.27 % (12 / 44)  |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Asymmetric AQs / All placed AQs  | 72.73 % (32 / 44)  |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Signed AQs / All placed AQs      | 100.00 % (44 / 44) |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Unsigned AQs / All placed AQs    | 0.00 % (0 / 44)    |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Per-tensor AQs / All placed AQs  | 100.00 % (44 / 44) |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 || Per-channel AQs / All placed AQs | 0.00 % (0 / 44)    |
+Epoch 1 |+----------------------------------+--------------------+
+Epoch 1 |
+Epoch 1 |Statistics of the bitwidth distribution:
+Epoch 1 |+--------------+---------------------+--------------------+--------------------+
+Epoch 1 || Num bits (N) | N-bits WQs / Placed |    N-bits AQs /    | N-bits Qs / Placed |
+Epoch 1 ||              |         WQs         |     Placed AQs     |         Qs         |
+Epoch 1 |+==============+=====================+====================+====================+
+Epoch 1 || 8            | 100.00 % (38 / 38)  | 100.00 % (44 / 44) | 100.00 % (82 / 82) |
+Epoch 1 |+--------------+---------------------+--------------------+--------------------+

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b6286ab040d463864d3a8e9d171df02ea044a878ecd16646b9aef131ecbea578
 size 268172814

 version https://git-lfs.github.com/spec/v1
+oid sha256:944a5d2394f6acb552e004deceb897c4f1955b3c98a0e7d7582e2f51dbd1e47c
 size 268172814

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:77fea7f47d6f91af7370043dfc2a8acdf21fdd576a23651c0002ed739a6042b2
 size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:93c59411257f8aad451b10112f68fc1a68f69691452b7210ba820b2e2b131f38
 size 4600