Generalization bounds for deep learning

Valle-Pérez, Guillermo; Louis, Ard A.

doi:10.48550/arxiv.2012.04115

Cited by 4 publications

(4 citation statements)

References 52 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, the complexity of the hypothesis space bound increases as the number of parameters becomes larger and the stability-based bound grows with respect to the iterations of optimization. To handle the problems and further improve the generalization bound of GNNs, researchers can leverage recent advances in deep learning theory such as local Rademacher complexity [178], marginal-likelihood PAC-Bayes [179] and H-consistency [180]. Besides, it is observed that existing generalization bounds often heavily rely on the number of nodes and maximum node degree as the graph-related term in their final expressions, which is too coarse-grained to capture the complex graph structure information.…”

Section: Discussionmentioning

confidence: 99%

The prevalence and characteristics of metabolic syndrome according to different definitions in China: a nationwide cross-sectional study, 2012–2015

et al. 2022

View full text Add to dashboard Cite

Background Metabolic syndrome (MetS) is characterized by a cluster of signs of metabolic disturbance and has caused a huge burden on the health system. The study aims to explore the prevalence and characteristics of MetS defined by different criteria in the Chinese population. Methods Using the data of the China Hypertension Survey (CHS), a nationally representative cross-sectional study from October 2012 to December 2015, a total of 28,717 participants aged 35 years and above were included in the analysis. The MetS definitions of the International Diabetes Federation (IDF), the updated US National Cholesterol Education Program Adult Treatment Panel III (the revised ATP III), and the Joint Committee for Developing Chinese Guidelines (JCDCG) on Prevention and Treatment of Dyslipidemia in Adults were used. Multivariable logistic regression was used to identify factors associated with MetS. Results The prevalence of MetS diagnosed according to the definitions of IDF, the revised ATP III, and JCCDS was 26.4%, 32.3%, and 21.5%, respectively. The MetS prevalence in men was lower than in women by IDF definition (22.2% vs. 30.3%) and by the revised ATP III definition (29.2% vs. 35.4%), but the opposite was true by JCDCG (24.4%vs 18.5%) definition. The consistency between the three definitions for men and the revised ATP III definition and IDF definition for women was relatively good, with kappa values ranging from 0.77 to 0.89, but the consistency between the JCDCG definition and IDF definition (kappa = 0.58) and revised ATP III definition (kappa = 0.58) was poor. Multivariable logistic regression showed that although the impact and correlation intensity varied with gender and definition, area, age, education, smoking, alcohol use, and family history of cardiovascular disease were factors related to MetS. Conclusions The prevalence and characteristics of the MetS vary with the definition used in the Chinese population. The three MetS definitions are more consistent in men but relatively poor in women. On the other hand, even if estimated according to the definition of the lowest prevalence, MetS is common in China.

show abstract

Section: Discussionmentioning

confidence: 99%

The prevalence and characteristics of metabolic syndrome according to different definitions in China: a nationwide cross-sectional study, 2012–2015

et al. 2022

View full text Add to dashboard Cite

show abstract

“…A comprehensive review by Valle-Pérez and Louis (2020) shows that, most types of generalization bounds are characterized by the norm of network weights or other related quantities such as the Lipschitz constant of the network; some examples include the margin-based bounds (Bartlett et al, 2017), the sensitivity-based bounds (Neyshabur et al, 2017), the NTK-based bounds (Cao and Gu, 2019) or the compression-based bounds (Li et al, 2020). They are model-agnostic in the sense that, only a generic functional form of the network is required and the weights are then implicitly regularized by some learning algorithms.…”

Section: Comparison With Other Existing Workmentioning

confidence: 99%

A New Measure of Model Redundancy for Compressed Convolutional Neural Networks

Huang¹,

Si²,

Zheng³

et al. 2021

Preprint

View full text Add to dashboard Cite

While recently many designs have been proposed to improve the model efficiency of convolutional neural networks (CNNs) on a fixed resource budget, theoretical understanding of these designs is still conspicuously lacking. This paper aims to provide a new framework for answering the question: Is there still any remaining model redundancy in a compressed CNN? We begin by developing a general statistical formulation of CNNs and compressed CNNs via the tensor decomposition, such that the weights across layers can be summarized into a single tensor. Then, through a rigorous sample complexity analysis, we reveal an important discrepancy between the derived sample complexity and the naive parameter counting, which serves as a direct indicator of the model redundancy. Motivated by this finding, we introduce a new model redundancy measure for compressed CNNs, called the K/R ratio, which further allows for nonlinear activations. The usefulness of this new measure is supported by ablation studies on popular block designs and datasets.

show abstract

“…Generalization in reinforcement learning Generalizing a model's predictions across a variety of unseen, high-dimensional inputs has been extensively studied in the static supervised learning setting [Bartlett, 1998, Triantafillou et al, 2019, Valle-Pérez and Louis, 2020, Liu et al, 2020b. Gener-…”

Section: Related Workmentioning

confidence: 99%

Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions

Mazoure¹,

Kostrikov²,

Nachum³

et al. 2021

Preprint

View full text Add to dashboard Cite

Reinforcement learning (RL) agents are widely used for solving complex sequential decision making tasks, but still exhibit difficulty in generalizing to scenarios not seen during training. While prior online approaches demonstrated that using additional signals beyond the reward function can lead to better generalization capabilities in RL agents, i.e. using self-supervised learning (SSL), they struggle in the offline RL setting, i.e. learning from a static dataset. We show that performance of online algorithms for generalization in RL can be hindered in the offline setting due to poor estimation of similarity between observations. We propose a new theoretically-motivated framework called Generalized Similarity Functions (GSF), which uses contrastive learning to train an offline RL agent to aggregate observations based on the similarity of their expected future behavior, where we quantify this similarity using generalized value functions. We show that GSF is general enough to recover existing SSL objectives while also improving zero-shot generalization performance on a complex offline RL benchmark, offline Procgen. * Work done while at Google Brain.

show abstract

Generalization bounds for deep learning

Cited by 4 publications

References 52 publications

The prevalence and characteristics of metabolic syndrome according to different definitions in China: a nationwide cross-sectional study, 2012–2015

The prevalence and characteristics of metabolic syndrome according to different definitions in China: a nationwide cross-sectional study, 2012–2015

A New Measure of Model Redundancy for Compressed Convolutional Neural Networks

Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions

Contact Info

Product

Resources

About