| Authors | مهسا سهیل شمائی,سجاد فتحی هفشجانی |
| Journal | Axioms |
| Page number | 1 |
| Volume number | 13 |
| IF | 1.9 |
| Paper Type | Full Paper |
| Published At | 2024-12-06 |
| Journal Grade | Scientific - research |
| Journal Type | Electronic |
| Journal Country | Iran, Islamic Republic Of |
| Journal Index | JCR ,SCOPUS |
Abstract
This paper proposes a novel sine step size for warm-restart stochastic gradient descent (SGD). For the SGD based on the new proposed step size, we establish convergence rates for smooth non-convex functions with and without the Polyak–Łojasiewicz (PL) condition. To assess the effectiveness of the new step size, we implemented it across several datasets, including FashionMNIST, CIFAR10, and CIFAR100. This implementation was compared against eight distinct existing methods. The experimental results demonstrate that the proposed sine step size improves the test accuracy of the CIFAR100 dataset by 1.14%
. This improvement highlights the efficiency of the new step size when compared to eight other popular step size methods.