• Corpus ID: 230433762

A selective review on calibration information from similar studies based on parametric likelihood or empirical likelihood

  title={A selective review on calibration information from similar studies based on parametric likelihood or empirical likelihood},
  author={Jing Qin and Yukun Liu and Pengfei Li},
In multi-center clinical trials, due to various reasons, the individual-level data are strictly restricted to be assessed publicly. Instead, the summarized information is widely available from published results. With the advance of computational technology, it has become very common in data analyses to run on hundreds or thousands of machines simultaneous, with the data distributed across those machines and no longer available in a single central location. How to effectively assemble the… 



A unified approach for synthesizing population‐level covariate effect information in semiparametric estimation with survival data

A unified framework by employing the generalized method of moments to combine information from different sources for the analysis of survival data is presented and the proposed estimator can be shown to be consistent and asymptotically normal; moreover, it is more efficient than the maximum partial likelihood estimator.

Generalized integration model for improved statistical inference by leveraging external summary data

SummaryMeta-analysis has become a powerful tool for improving inference by gathering evidence from multiple sources. It pools summary-level data from different studies to improve estimation

On the relative efficiency of using summary statistics versus individual-level data in meta-analysis.

It is shown that, for all commonly used parametric and semiparametric models, there is no asymptotic efficiency gain by analyzing original data if the parameter of main interest has a common value across studies, the nuisance parameters have distinct values among studies, and the summary statistics are based on maximum likelihood.

Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-Level Information From External Big Data Sources

A set of very general constraints are identified that link internal and external models and these constraints are used to develop a framework for semiparametric maximum likelihood inference that allows the distribution of covariates to be estimated using either the internal sample or an external reference sample.

Using covariate-specific disease prevalence information to increase the power of case-control studies

A statistical procedure is developed that improves the efficiency of the logistic regression model for a case-control study by utilizing auxiliary information on covariate-specific disease prevalence via a series of unbiased estimating equations.

Generalized linear models incorporating population level information: an empirical‐likelihood‐based approach

This paper provides a two-step algorithm that produces parameter estimates using only unconstrained estimation and gives an application to demographic hazard modelling by combining panel survey data with birth registration data to estimate annual birth probabilities by parity.

Meta-analysis in clinical trials.

Heterogeneity-aware and communication-efficient distributed statistical inference

This paper proposes distributed algorithms which account for the heterogeneous distributions by allowing site-specific nuisance parameters and establishes the non-asymptotic risk bound of the proposed distributed estimator and its limiting distribution in the two-index asymptotic setting.

Empirical likelihood ratio confidence intervals for a single functional

SUMMARY The empirical distribution function based on a sample is well known to be the maximum likelihood estimate of the distribution from which the sample was taken. In this paper the likelihood

On asymptotically optimal confidence regions and tests for high-dimensional models

A general method for constructing confidence intervals and statistical tests for single or low-dimensional components of a large parameter vector in a high-dimensional model and develops the corresponding theory which includes a careful analysis for Gaussian, sub-Gaussian and bounded correlated designs.