Real-Data-Beispiele in Biostatistik- und Bioinformatik-Fachzeitschriften: Ein Survey

Episódios

Generalized Additive Models with Unknown Link Function Including Variable Selection
21 mai 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
The generalized additive model is a well established and strong tool that allows to model smooth effects of predictors on the response. However, if the link function, which is typically chosen as the canonical link, is misspecified, substantial bias is to be expected. A procedure is proposed that
simultaneously estimates the form of the link function and the unknown form of the predictor functions including selection of predictors. The procedure is based on boosting methodology, which obtains estimates by using a sequence of weak learners. It strongly dominates fitting procedures that are unable to modify a given link function if the true link function deviates from the fixed function. The performance of the procedure is shown
in simulation studies and illustrated by a real world example.
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
On the Procrustean analogue of individual differences scaling (INDSCAL)
10 mai 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
In this paper, individual differences scaling (INDSCAL) is revisited, considering
INDSCAL as being embedded within a hierarchy of individual difference scaling
models. We explore the members of this family, distinguishing (i) models, (ii) the
role of identification and substantive constraints, (iii) criteria for fitting models and (iv) algorithms to optimise the criteria. Model formulations may be based either on data that are in the form of proximities or on configurational matrices. In its configurational version, individual difference scaling may be formulated as a form of generalized Procrustes analysis. Algorithms are introduced for fitting the new
models. An application from sensory evaluation illustrates the performance of the
methods and their solutions.
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Estão a faltar episódios?

Clique aqui para atualizar o feed.
Global permutation tests for multivariate ordinal data: alternatives, test statistics, and the null dilemma
17 abr 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
We discuss two-sample global permutation tests for sets of multivariate ordinal data in possibly high-dimensional setups, motivated by the analysis of data collected by means of the World Health Organisation's International Classification of Functioning,
Disability and Health. The tests do not require any modelling of the multivariate dependence structure. Specifically, we consider testing for marginal inhomogeneity and
direction-independent marginal order. Max-T test statistics are known to lead to good
power against alternatives with few strong individual effects. We propose test statistics that can be seen as their counterparts for alternatives with many weak individual effects. Permutation tests are valid only if the two multivariate distributions are identical under the null hypothesis. By means of simulations, we examine the practical impact of violations of this exchangeability condition. Our simulations suggest that theoretically invalid permutation tests can still be 'practically valid'. In particular, they suggest that the degree of the permutation procedure's failure may be considered as a function of the difference in group-specific covariance matrices, the proportion between group sizes, the number of variables in the set, the test statistic used, and the number of levels per variable.
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm
1 fev 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
In linear mixed models, the assumption of normally distributed random effects is often inappropriate and unnecessarily restrictive. The proposed approximate Dirichlet process mixture assumes a hierarchical Gaussian mixture that is based on the truncated version of the stick breaking presentation of the Dirichlet process. In addition to the weakening of distributional assumptions, the specification allows to identify clusters of observations with a similar random effects structure. An Expectation-Maximization algorithm is given that solves the estimation problem and that, in certain respects, may exhibit advantages over Markov chain Monte Carlo approaches when modelling with Dirichlet processes. The method is evaluated in a simulation study and applied to the dynamics of unemployment in Germany as well as lung function growth data.
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Variable selection with Random Forests for missing data
15 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
Variable selection has been suggested for Random Forests to improve their efficiency of data prediction and interpretation. However, its basic element, i.e. variable importance measures, can not be computed straightforward when there is missing data. Therefore an extensive simulation study has been conducted to explore possible solutions, i.e. multiple imputation, complete case analysis and a newly suggested importance measure for several missing data generating processes. The ability to distinguish relevant from non-relevant variables has been investigated for these procedures in combination with two popular variable selection methods. Findings and recommendations: Complete case analysis should not be applied as it lead to inaccurate variable selection and models with the worst prediction accuracy. Multiple imputation is a good means to select variables that would be of relevance in fully observed data. It produced the best prediction accuracy. By contrast, the application of the new importance measure causes a selection of variables that reflects the actual data situation, i.e. that takes the occurrence of missing values into account. It's error was only negligible worse compared to imputation.
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Modellierung der Heterogenität in Bradley-Terry-Luce Modellen
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Vergleich mehrerer Verfahren für multiples Testen bei der Analyse volatiler organischer Komponenten verschiedener Bakterien und Pilze zur Erregerdifferenzierung
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Parallel Boosting
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Was sind die Risikofaktoren für Rehe, vom Luchs gerissen zu werden?
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Affiliate Marketing: Analyse zeitlicher Aspekte im Online-Shopping
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Spezifikation der Linkfunktionen in diskreten Verweildauermodellen
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Boosting-Techniken zur Modellierung itemmodifizierender Effekte in Item- Response-Modellen
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Real-Data-Beispiele in Biostatistik- und Bioinformatik-Fachzeitschriften: Ein Survey
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Clusteranalyse zur Gruppierung von Items: Strategien zur Auffindung von Faktorstrukturen
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Statistical methods for comparison of two inaccurate measurement procedures in experiments with measurement replications
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Regularisierte Schätzverfahren für Bradley-Terry-Luce Modelle
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Clustering bilingual text corpora using mixtures of von Mises-Fisher distributions
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Modelling Comparison Data with Ordinal Response
1 jan 2013· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
A Technical Note on the Dirichlet-Multinomial Model
4 out 2012· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
This short note contains an explicit proof of the Dirichlet distribution being the conjugate prior to the Multinomial sample distribution as resulting from the general construction method described, e.g., in Bernardo and Smith (2000). The well-known Dirichlet-Multinomial model is thus shown to fit into the framework of canonical conjugate analysis (Bernardo and Smith 2000, Prop.~5.6, p.~273), where the update step for the prior parameters to their posterior counterparts has an especially simple structure. This structure is used, e.g., in the Imprecise Dirichlet Model (IDM) by Walley (1996), a simple yet powerful model for imprecise Bayesian inference using sets of Dirichlet priors to model vague prior knowledge, and furthermore in other imprecise probability models for inference in exponential families where sets of priors are considered.
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Stability of impurities with Coulomb potential in graphene with homogeneous magnetic field
9 jul 2012· Mathematik, Informatik und Statistik - Open Access LMU - Teil 02/03
- Ouvir Ouvir novamente Continuar A reproduzir…
- Ouvir depois Ouvir depois
Mostrar mais

Episódios

Generalized Additive Models with Unknown Link Function Including Variable Selection

On the Procrustean analogue of individual differences scaling (INDSCAL)

Global permutation tests for multivariate ordinal data: alternatives, test statistics, and the null dilemma

Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm

Variable selection with Random Forests for missing data

Modellierung der Heterogenität in Bradley-Terry-Luce Modellen

Vergleich mehrerer Verfahren für multiples Testen bei der Analyse volatiler organischer Komponenten verschiedener Bakterien und Pilze zur Erregerdifferenzierung

Parallel Boosting

Was sind die Risikofaktoren für Rehe, vom Luchs gerissen zu werden?

Affiliate Marketing: Analyse zeitlicher Aspekte im Online-Shopping

Spezifikation der Linkfunktionen in diskreten Verweildauermodellen

Boosting-Techniken zur Modellierung itemmodifizierender Effekte in Item- Response-Modellen

Clusteranalyse zur Gruppierung von Items: Strategien zur Auffindung von Faktorstrukturen

Statistical methods for comparison of two inaccurate measurement procedures in experiments with measurement replications

Regularisierte Schätzverfahren für Bradley-Terry-Luce Modelle

Clustering bilingual text corpora using mixtures of von Mises-Fisher distributions

Modelling Comparison Data with Ordinal Response

A Technical Note on the Dirichlet-Multinomial Model

Stability of impurities with Coulomb potential in graphene with homogeneous magnetic field