Conference Presentation (After Call) FZJ-2024-00647

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
SGD Biased towards Early Important Samples for Efficient Training

 ;  ;  ;

2023

International Conference on Data Mining, ICDM2023, ShanghaiShanghai, Peoples R China, 1 Dec 2023 - 4 Dec 20232023-12-012023-12-04 [10.34734/FZJ-2024-00647]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: In deep learning, using larger training datasets usually leads to more accurate models. However, simply adding more but redundant data may be inefficient, as some training samples may be more informative than others. We propose to bias SGD (Stochastic Gradient Descent) towards samples that are found to be more important after a few training epochs, by sampling them more often for the rest of training. In contrast to state-of-the-art, our approach requires less computational overhead to estimate sample importance, as it computes estimates once during training using the prediction probabilities, and does not require that training be restarted. In the experimental evaluation, we see that our learning technique trains faster than state-of-the-art and can achieve higher test accuracy, especially when datasets are not well balanced. Lastly, results suggest that our approach has intrinsic balancing properties. Code is available at https://github.com/AlessioQuercia/sgd biased.


Contributing Institute(s):
  1. Datenanalyse und Maschinenlernen (IAS-8)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  2. HDS LEE - Helmholtz School for Data Science in Life, Earth and Energy (HDS LEE) (HDS-LEE-20190612) (HDS-LEE-20190612)

Appears in the scientific report 2023
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Conference Presentations
Institute Collections > IAS > IAS-8
Workflow collections > Public records
Publications database
Open Access

 Record created 2024-01-16, last modified 2024-02-26


OpenAccess:
Download fulltext PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)