Modify this code for better accuracy #40

1033020837 · 2020-02-26T17:10:13Z

According to this document:https://huggingface.co/transformers/model_doc/bert.html:
the pooler_output of bert_model is usually not a good summary of the semantic content of the input, we should better with averaging or pooling the sequence of hidden-states for the whole input sequence.

So modify the BertForMultiLabel.py to:
avg_output = torch.mean(outputs[0],1).view(-1,self.config.hidden_size)
logits = self.classifier(avg_output)

You could get better performance through this modification.

lonePatient · 2020-02-27T01:20:30Z

@1033020837 Thanks a lot, I will try.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify this code for better accuracy #40

Modify this code for better accuracy #40

1033020837 commented Feb 26, 2020

lonePatient commented Feb 27, 2020

Modify this code for better accuracy #40

Modify this code for better accuracy #40

Comments

1033020837 commented Feb 26, 2020

lonePatient commented Feb 27, 2020