Learning Sparse Causal Gaussian Networks With Experimental Intervention: Regularization and Coordinate Descent

Fu, Fei; Zhou, Qing

doi:10.1080/01621459.2012.754359

Cited by 70 publications

(130 citation statements)

References 32 publications

Supporting

Mentioning

129

Contrasting

Order By: Relevance

“…Zou (2006) showed that the adaptive lasso satisfies the consistency in model selection if Ã ij is a

\sqrt{n}

-consistent estimate of A ij and suggested using the ordinary least squares (OLS) estimate for Ã ij . Fu and Zhou (2013) proposed the ordinary least squares (OLS) estimate with an upper bound, which is

false(\frac{1}{false| Ã_{i j} |^{γ}}, \frac{1}{{false(N^{- 1} false)}^{γ}} false)

with N = 10 4 . However, if there are correlations among variables, the estimates from OLS are unstable.…”

Section: Problem Formulationmentioning

confidence: 99%

“…The formula (8) provides the lower and upper bound of 1 and N γ , respectively. In our simulation study, we use N = 10 4 as Fu and Zhou (2013) did. We construct the initial estimates Ã ij from the regular lasso estimates by minimizing function (8) with a certain λ 0 , γ, and w ij = 1.…”

Section: Problem Formulationmentioning

confidence: 99%

“…We compared the NS-DIST method with the following five recent DAG estimation methods: the PC-stable method provided by Colombo and Maathuis (2013), the MMHC method provided by Tsamardinos et al (2006), the GES method provided by Chickering (2002b), the CD algorithm provided by Fu and Zhou (2013), and a permutation approach with the Lasso framework provided by Shojaie and Michailidis (2010). As proposed by Fu and Zhou (2013), we use γ = 0.15 for the CD method.…”

Section: Simulation Studymentioning

confidence: 99%

“…In addition, Zou (2006) proposed the adaptive lasso regression, which gives asymptotic consistency in variable selection. Fu and Zhou (2013) proposed a profiled likelihood with an adaptive lasso penalty to estimate DAGs under unknown variable order based on experimental data with interventions, and used a blockwise coordinate descent (CD) algorithm to find a local optimal solution. Aragam and Zhou (2015) improved the CD algorithm under the observational data.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Estimation of Directed Acyclic Graphs Through Two-Stage Adaptive Lasso for Gene Network Inference

Han

Chen²,

Cheon

et al. 2016

Journal of the American Statistical Association

View full text Add to dashboard Cite

Graphical models are a popular approach to find dependence and conditional independence relationships between gene expressions. Directed acyclic graphs (DAGs) are a special class of directed graphical models, where all the edges are directed edges and contain no directed cycles. The DAGs are well known models for discovering causal relationships between genes in gene regulatory networks. However, estimating DAGs without assuming known ordering is challenging due to high dimensionality, the acyclic constraints, and the presence of equivalence class from observational data. To overcome these challenges, we propose a two-stage adaptive Lasso approach, called NS-DIST, which performs neighborhood selection (NS) in stage 1, and then estimates DAGs by the Discrete Improving Search with Tabu (DIST) algorithm within the selected neighborhood. Simulation studies are presented to demonstrate the effectiveness of the method and its computational efficiency. Two real data examples are used to demonstrate the practical usage of our method for gene regulatory network inference.

show abstract

“…Zou (2006) showed that the adaptive lasso satisfies the consistency in model selection if Ã ij is a

\sqrt{n}

false(\frac{1}{false| Ã_{i j} |^{γ}}, \frac{1}{{false(N^{- 1} false)}^{γ}} false)

with N = 10 4 . However, if there are correlations among variables, the estimates from OLS are unstable.…”

Section: Problem Formulationmentioning

confidence: 99%

Section: Problem Formulationmentioning

confidence: 99%

Section: Simulation Studymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Estimation of Directed Acyclic Graphs Through Two-Stage Adaptive Lasso for Gene Network Inference

Han

Chen²,

Cheon

et al. 2016

Journal of the American Statistical Association

View full text Add to dashboard Cite

show abstract

“…However, it tends to yields a large number of false positives in the sparse network problem, as pointed out by Fu and Zhou in their seminal paper (Fu & Zhou, 2013). Fu and Zhou proposed an "elbow method" that outperforms the cross-validation method, where the optimal tuning parameter corresponds to the change point at which an increase of λ does not yield a substantial decrease of log-likelihood.…”

Section: Cox Proportional Hazard Model With Sparse Group Lasso Penaltymentioning

confidence: 99%

Identification of Biomarkers for Predicting the Overall Survival of Ovarian Cancer Patients: a Sparse Group Lasso Approach

Mai¹,

Zhang²

2016

IJSP

View full text Add to dashboard Cite

Next-generation sequencing has been routinely applied to cancer biology, making it possible for researchers to elucidate the molecular mechanisms underlying cancer initiation and progression. However, how to identify oncomarkers from massive complex genomic data poses a great challenge for both modeling and computing. In this paper, we propose a novel computational pipeline to identify genes related to the overall survival of ovarian cancer patients from the rich Cancer Genome Atlas data. Different from the existing studies, we incorporate dependence structure among genes and pathway information into the variable selection. Firstly, the dimensionality of the ovarian cancer data is reduced by a novel stepwise feature screening which mimics the hierarchy of the underlying causal network. The second step of the pipeline is to divide genes into clusters with distinct cellular functions by k-means, x-means and PAMSAM learning algorithms. In the final step, we fit a cox proportional hazard model with a sparse group lasso penalty for further variable selection. Of the 115 genes in the final list, many were reported to be associated with cancer initiation or progression in the literature. In addition, we find several gene families including the NEK family and RNF family, which are closely associated with the survival of ovarian cancer patients.

show abstract

Recent Development in Methodology for Gene Network Problems and Inferences

Han

Zhong

Yang³

et al. 2016

Healthcare Analytics: From Data to Knowledge to Healthcare Improvement

View full text Add to dashboard Cite

Learning Sparse Causal Gaussian Networks With Experimental Intervention: Regularization and Coordinate Descent

Cited by 70 publications

References 32 publications

Estimation of Directed Acyclic Graphs Through Two-Stage Adaptive Lasso for Gene Network Inference

Estimation of Directed Acyclic Graphs Through Two-Stage Adaptive Lasso for Gene Network Inference

Identification of Biomarkers for Predicting the Overall Survival of Ovarian Cancer Patients: a Sparse Group Lasso Approach

Recent Development in Methodology for Gene Network Problems and Inferences

Contact Info

Product

Resources

About