DEPARTMENT OF MATHEMATICS
[ Search | Contact  ]
                                                                             Bayesian Methods Workshop & Distinguished Lectures 4/30-5/1
                                                                                                           Seminars from Previous Semesters
                                                                                                           Stat Consortium Lectures from 2007
                                                                                                           Stat Seminars from Fall 2007

Statistics Seminar, 2008-2009

FALL 2008 SEMINAR TALKS:

(Fall 2008, Seminar No. 1)

SPEAKER: Prof. Hosam M. Mahmoud
The George Washington University,
Washington, D.C. 20052, U.S.A.

TITLE: The Polya Process and Applications

TIME AND PLACE:  Thursday, September 18, 2008, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: We investigate the Polya process, which underlies an urn of white and blue balls growing in real time. A partial differential equation governs the evolution of the process. Some special cases are amenable to exact and asymptotic solution: they include the (forward or backward) diagonal processes, and the Ehrenfest process.

Applications of standard (discrete) urns and their analogue when embedded in real time include several classes of random trees that have applications in computer science, epidemiology and philology. We shall present some of these applications.




TIME AND PLACE:  Thursday, September 12, 2008, 3:30pm
            

NO Talk: AMSC celebration.



(Seminar No. 2)

SPEAKER: Anastasia Voulgaraki, M.Sc.
University of Maryland
College Park, MD 20742, U.S.A.

TITLE: Estimation of Death Rates in US States With Small Subpopulations

TIME AND PLACE:  Thursday, October 2, 2008, 3:30pm
            Room 1313, Math Bldg

ABSTRACT: The National Center for Health Statistics (NCHS) uses observed mortality data to publish race-gender specific life tables for individual states decennially. At ages over 85 years, the reliability of death rates based on these data is compromised to some extent by age misreporting. The eight-parameter Heligman-Pollard parametric model is then used to smooth the data and obtain estimates/extrapolation of mortality rates for advanced ages. In States with small sub-populations the observed mortality rates are often zero, particularly among young ages. The presence of zero death rates makes the fitting of the Heligman-Pollard model dificult and at times outright impossible. In addition, since death rates are reported on a log scale, zero mortality rates are problematic. To overcome observed zero death rates, appropriate probability models are used. Using these models, observed zero mortality rates are replaced by the corresponding expected values. This enables using logarithmic transformations, and the fitting of the Heligman-Pollard model to produce mortality estimates for ages 0-130 years.



(Seminar No. 3)

SPEAKER: Prof. Ali Arab
Georgetown University
Washington, D.C. 20057, U.S.A.

TITLE: Efficient Parameterization of PDE-Based Dynamics for Spatio-Temporal Processes

TIME AND PLACE:  Thursday, October 16, 2008, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: Spatio-temporal dynamical processes in the physical and environmental sciences are often described by partial differential equations (PDEs). The inherent complexity of such processes due to high- dimensionality and multiple scales of spatial and temporal variability is often intensified by characteristics such as sparsity of data, complicated boundaries and irregular geometrical spatial domains, among others. In addition, uncertainties in the appropriateness of any given PDE for a real-world process, as well as uncertainties in the parameters associated with the PDEs are typically present. These issues necessitate the incorporation of efficient parameterizations of spatio-temporal models that are capable of addressing such characteristics. A hierarchical Bayesian model characterized by the PDE-based dynamics for spatio-temporal processes based on their Galerkin finite element method (FEM) representations is developed and discussed. As an example, spatio-temporal models based on advection-diffusion processes are considered. Finally, an application of the hierarchical Bayesian modeling approach is presented which considers the analysis of tracking data obtained from DST (data storage devices) sensors to mimic the pre-spawning upstream migration process of the declining shovelnose sturgeon.



(Seminar No. 4)

SPEAKER: Prof. Sandra Cerrai
University of Maryland
College Park, MD 20742, U.S.A.

TITLE: A central limit theorem for some reaction-diffusion equations with fast oscillating perturbation

TIME AND PLACE:  Thursday, October 23, 2008, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: We study the normalized difference between the solution $u_\e$ of a reaction-diffusion equation in a bounded interval $[0,L]$ perturbed by a fast oscillating term, arising as the solution of a stochastic reaction-diffusion equation with a strong mixing behavior, and the solution $\bar{u}$ of the corresponding averaged equation. We assume the smoothness of the reaction coefficient and we prove that a central limit type theorem holds. Namely, we show that the normalized difference $(u_\e-\bar{u})/\sqrt{\e}$ converges weakly in $C([0,T];L^2(0,L))$ to the solution of the linearized equation where an extra Gaussian term appears.



(Seminar No. 5)

SPEAKER: Prof. Edward J. Wegman
George Mason University
Fairfax, VA 22030, U.S.A.

TITLE: Mixture Models for Document Clustering

TIME AND PLACE:  Thursday, October 30, 2008, 3:30pm
            Colloquium Room 3206, Math Build (not the usual room.)
Talk sponsored by Math Stat and the Stat Consortium. There will be a eception following the talk in the Math Lounge 3201.

ABSTRACT: Automatic clustering and classification of documents within corpora is a challenging task. Often, comparing word usage within the corpus, the so-called bag-of-words methodology, does this. The lexicon for a corpus can indeed be very large. For the example of 503 documents that we consider, there are more than 7000 distinct terms and more than 91,000 bigrams. This means that a term vector characterizing a document will be approximately 7000 dimensional. In this talk, we use an adaptation of normal mixture models with 7000 dimensional data to locate centroids of clusters. The algorithm works surprisingly well and is linear in all the size metrics.



(Seminar No. 6)

SPEAKER: Dr. Michail Sverchkov
BAE Systems IT and Bureau of Labor Statistics
Washington, DC 20212-0001, U.S.A.

TITLE: On Estimation of Response Probabilities when Missing Data are Not Missing at Random

TIME AND PLACE:  Thursday, November 6, 2008, 3:30pm
            Room 1313, Math Bldg

ABSTRACT: Most methods that deal with the estimation of response probabilities assume either explicitly or implicitly that the missing data are 'missing at random' (MAR). However, in many practical situations this assumption is not valid, since the probability to respond often depends directly on the outcome value. The case where the missing data are not MAR (NMAR) can be treated by postulating a parametric model for the distribution of the outcomes before non-response and a model for the response mechanism. The two models define a parametric model for the joint distribution of the outcomes and response indicators, and therefore the parameters of these models can be estimated by maximization of the likelihood corresponding to this distribution. Modeling the distribution of the outcomes before non-response, however, can be problematic since no data are available from this distribution.

In this talk we propose an alternative approach that allows to estimate the parameters of the response model without modelling the distribution of the outcomes before non-response. The approach utilizes relationships between the population, the sample and the sample complement distributions derived in Pfeffermann and Sverchkov (1999, 2003) and Sverchkov and Pfeffermann (2004).

Key words: sample distribution, complement-sample distribution, prediction under informative sampling or non-response, estimating equations, missing information principle, non-parametric estimation



(Seminar No. 7)

SPEAKER: Prof. Malay Ghosh
University of Florida
Gainesville, FL 32611-8545 , U.S.A.

TITLE: Bayesian Benchmarking in Small Area Estimation

TIME AND PLACE:  Thursday, November 13, 2008, 3:30pm
            Room 1313, Math Bldg

ABSTRACT: Abstract



( Seminar No. 8: Special Tuesday Seminar)

SPEAKER: Prof. Gauri S. Datta
University of Georgia
Athens, GA 30602, U.S.A.

TITLE: Estimation of Small Area Means under Measurement Error Models

TIME AND PLACE:  Tuesday, November 18, 2008, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: In recent years demand for reliable estimates for characteristics of small domains (small areas) has greatly increased worldwide due to growing use of such estimates in formulating policies and programs, allocating government funds, planning regional development, and marketing decisions at local level. However, due to cost and operational considerations, it is seldom possible to procure a large enough overall sample size to support direct estimates of adequate precision for all domains of interest. It is often necessary to employ indirect estimates for small areas that can increase the effective domain sample size by borrowing strength from related areas through linking models, using census and administrative data and other auxiliary data associated with the small areas. To this end, the nested error regression model for unit-level data and the Fay-Herriot model for the area-level data have been widely used in small area estimation. These models usually treat that the explanatory variables are measured without error. However, explanatory variables are often subject to measurement error. Both functional and structural measurement error models have been recently proposed by researchers in small area estimation to deal with this issue. In this talk, we consider both functional and structural measurement error models in discussing empirical Bayes (equivalently, empirical BLUP) estimation of small area means.



(Seminar No. 9)

SPEAKER: Dr. Gang Zheng
Office of Biostatistics Research, National Heart, Lung and Blood Institute
6701 Rockledge Drive, Bethesda, MD 20892-7913, U.S.A.

TITLE: On Robust Tests for Case-Control Genetic Association Studies

TIME AND PLACE:  Thursday, November 20, 2008, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: When testing association between a single marker and the disease using case-control samples, the data are presented in a 2x3 table. Pearson's Chi-square test (2 df) and the trend test (1 df) are commonly used. Usually one does not know which of them to choose. It depends on the unknown genetic model underlying the data. So one could either choose the maximum (MAX) of a family of trend tests over all possible genetic models (Davies, 1977, 1987) or take the smaller p-values (MIN2) of Pearson's test and the trend test (Wellcome Trust Case-Control Consortium, 2007).

We show that Pearson's test, the trend test and MAX are all trend tests with different types of scores: data-driven or prespecified and restricted or not restricted. The results provide insight into the properties that MAX is always more powerful than Pearson's test when the genetic model is restricted and that Pearson's test is more robust when the model is not restricted. For the MIN2 of WTCCC (2007), we show that its null distribution can be derived, so the p-value of MIN2 can be obtained. Simulation is used to compare the above four tests. We apply MIN2 to the result obtained by The SEARCH Collaborative Group (NEJM, August 21, 2008) who used MIN2 to detect a SNP in a genome-wide association study, but could not report the p-value for that SNP when MIN2 was used.

References:

1. Joo J, Kwak M, Ahn K and Zheng G. A robust genome-wide scan statistic of the Wellcome Trust Case-Control Consortium. Biometrics (to appear).
2. Zheng G, Joo J and Yang Y. Pearson's test, trend test, and MAX are all trend tests with different type of scores. Unpublished manuscript. See Slides.



(Seminar No. 10: This Seminar is on a Tuesday)

SPEAKER: Dr. Yair Goldberg
Hebrew University of Jerusalem,
Mt. Scopus, Jerusalem, Israel

TITLE: Manifold learning: The price of normalization

TIME AND PLACE:  Tuesday, November 25, 2008, 3:30pm
             Room 1313, Math Bldg (room number may change)

ABSTRACT: The problem of finding a compact representation for high-dimensional data is encountered in many areas of science and has motivated the development of various dimension-reducing algorithms. The Laplacian EigenMap dimension-reducing algorithm (Belkin & Niyogi, 2003) is widely used for its intuitive approach and computational simplicity, claims to reveal the underlying non-linear structure of high-dimensional data.

We present a general class of examples in which the Laplacian EigenMap fails to generate a reasonable reconstruction of the data given to it. We both prove our results analytically and show them empirically. This phenomenon is then explained with an analysis of the limit-case behavior of the Laplacian EigenMap algorithm both using asymptotics and the continuous Laplacian operator. We also discuss the relevance of these findings to the algorithms Locally Linear Embedding (Roweis and Saul, 2000), Local Tangent Space Alignment (Zhang and Zha, 2004), Hessian Eigenmap (Donoho and Grimes, 2004), and Diffusion Maps (Coifman and Lafon, 2006).



(Seminar No. 11: DISTINGUISHED STATISTICS CONSORTIUM LECTURE
This Seminar is on a Friday)

SPEAKER: Mitchell H. Gail, M.D., Ph.D.
Senior Investigator
Biostatistics Branch, Div. Cancer Epidemiology & Genetics, National Cancer Institute,
Rockville, MD, 20852, U.S.A.

TITLE: Absolute Risk: Clinical Applications and Controversies

DATE/TIME:  Friday, December 5, 2008, 3:15--5:00pm

PLACE: Engineering Building Lecture Hall EGR 1202             

Immediately following the talk there will be a formal 25-minute Discussion, with a Reception to follow that.

ABSTRACT: Absolute risk is the probability that a disease will develop in a defined age interval in a person with specific risk factors. Sometimes absolute risk is called "crude" risk to distinguish it from the cumulative "pure" risk that might arise in the absence of competing causes of mortality. After defining absolute risk, I shall present a model for absolute breast cancer risk and illustrate its clinical applications. I will also describe the kinds of data and approaches that are used to estimate models of absolute risk and two criteria, calibration and discriminatory accuracy, that are used to evaluate absolute risk models. In particular, I will address whether well calibrated models with limited discriminatory accuracy can be useful.

Dr. Mitchell Gail received an M.D. from Harvard Medical School in 1968 and a Ph.D. in statistics from George Washington University in 1977. He joined NCI in 1969, and served as chief of the Biostatistics Branch from 1994 to 2008. Dr. Gail is a Fellow and former President of the American Statistical Association, a Fellow of the American Association for the Advancement of Science, an elected member of the American Society for Clinical Investigation, and an elected member of the Institute of Medicine of the National Academy of Sciences. He has received the Spiegelman Gold Medal for Health Statistics, the Snedecor Award for applied statistical research, the Howard Temin Award for AIDS Research, the NIH Director's Award, and the PHS Distinguished Service Medal.

Discussant: Professor Bilal Ayyub
Department of Civil & Environmental Engineering, UMCP
College Park, MD, 20742, U.S.A.


Discussion, 4:15pm: Engineering perspectives on Risk


Professor Ayyub is a Professor of Civil and Environmental Engineering at the University of Maryland College Park and Director of the Center for Technology and Systems Management. He is a Fellow of the ASCE, ASME, and SNAME.



(Seminar No. 12)

SPEAKER: Dr. Janice Lent
Energy Information Administration
Washington, DC 20585, U.S.A.

TITLE: Some Properties of Price Index Formulas

TIME AND PLACE:  Thursday, December 11, 2008, 3:30pm
            Room 1313, Math Bldg

ABSTRACT: Price indexes are important statistics that move large amounts of money in the U.S. economy. In order to adjust monetary figures for inflation/deflation, we must develop methods of using sample data to estimate changes in the value of a currency. A vast array of target price index formulas are discussed in the economics literature. In this seminar, we will present some of the formulas that are widely used by government statistical agencies as targets for price index estimation. We will examine and compare some of the properties of these formulas, including underlying economic assumptions, ease of estimation, and sensitivity to extreme values.



(Seminar No. 13)

SPEAKER: Dr. Zhe Lin
Institute for Advanced Computer Studies, University of Maryland
College Park, MD 20742, U.S.A.

TITLE: Recognizing Actions by Shape-Motion Prototypes

TIME AND PLACE:  Thursday, February 12, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: In this talk, I will introduce our recent work on gesture or action recognition based on shape-motion prototypes. During training, a set of action prototypes are learned in a joint shape and motion space via k-means clustering; During testing, humans are tracked while a frame-to-prototype correspondence is established by nearest neighbor search, and then actions are recognized using dynamic prototype sequence matching. Similarity matrices used for sequence matching are efficiently obtained by look-up table indexing, which is an order of magnitude faster than brute-force computation of frame-to-frame distance. Our approach enables robust action matching in very challenging situations (such as moving cameras, dynamic backgrounds) and allows automatic alignment of action sequences by dynamic time warping. Experimental results demonstrate that our approach achieves over 91% recognition rate on a large gesture dataset containing 294 video clips of 14 different gestures, and 100% on the Weizmann action dataset.



(Seminar No. 14)

SPEAKER: Prof. Refik Soyer
George Washington University
Washington, DC 20052, U.S.A.

TITLE: Information Importance of Predictors

TIME AND PLACE:  Thursday, February 19, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: The importance of predictors is characterized by the extent to which their use reduces uncertainty about predicting the response variable, namely their information importance. Shannon entropy is used to operationalize the concept. For nonstochastic predictors, maximum entropy characterization of probability distributions provides measures of information importance. For stochastic predictors, the expected entropy difference gives measures of information importance, which are invariant under one-to-one transformations of the variables. Applications to various data types lead to familiar statistical quantities for various models, yet with the unified interpretation of uncertainty reduction. Bayesian inference procedures for the importance and relative importance of predictors are developed. Three examples show applications to normal regression, contingency table, and logit analyses.



(Seminar No. 15)

SPEAKER: Lior Noy
Harvard Medical School
Boston, MA 02115, U.S.A.

TITLE: Studying Eye Movements in Movement Imitation

TIME AND PLACE:  Thursday, February 26, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: People, animals and robots can learn new actions from observation. In order to do so, they need to transform the visual input to motor output. What is the nature of this transformation? What are the visual features that are extracted and used by the imitator? A possible route for answering these questions is to analyze imitatorsN!NG eye movements during imitation. We monitored eye movements of human subjects while they were watching simple, one-arm movements in two conditions. In the watch-only condition the observers were instructed only to watch the movements. In the imitate condition the observers were instructed to watch and then to imitate each movement. Gaze trajectories were compared between the two conditions. In addition, we compared the human behavior to the predications of the Itti-Koch saliency-map model [1]. To determine the similarity among gaze trajectories of different observers we developed a novel comparison method, based on semi-parametric statistics. We compared this method to the more standard usage of cross-correlation scores and show the advantages of this method, in particular its ability to state that two gaze trajectories are either different or similar in a statistically significant way. Our results indicate that: (1)Subjects fixate at both the joints and the end-effectors of the observed moving arms, in contrast to previous reports [2]. (2)The Itti-Koch saliency-map model does not fully account for the human gaze trajectories. (3)Eye movements in movement imitation are similar to each other in the watch-only versus the imitate conditions.

Joint work with: Benjamin Kedem & Ritaja Sur, University of Maryland, and Tamar Flash, Weizmann Institute of Science.

References

[1] L. Itti and C. Koch. A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research, 40:1489-1506, 2000.
[2] M. J. Mataric and M. Pomplun. Fixation behavior in observation and imitation of human movement. Cognitive Brain Research, 7(2):191-202, 1998.



(Seminar No. 16)

SPEAKER: Prof. Yasmin H. Said (Bio)
George Mason University
Fairfax, Virginia 22030, U.S.A.

TITLE: Microsimulation of an Alcohol System

TIME AND PLACE:  Thursday, March 5, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: Users of alcohol are incorporated into a societal system, which for many purposes resembles an ecological system. An understanding of how this ecological alcohol system works provides an opportunity to evaluate effectiveness of interventions. I use a hybrid directed graph social network model calibrated with conditional probabilities derived from actual data with the idea of reproducing the experience of acute outcomes reflecting undesirable individual and societal outcomes. In the present model, I also approximate geospatial effects related to transportation as well as temporal effects. Drinking behaviors among underage users can be particularly harmful from both a societal and individual perspective. Using the model based on data from experiences in Fairfax County, Virginia, I am able to reproduce the multinomial probability distribution of acute outcomes with high accuracy using a microsimulation of all residents of Fairfax, approximately 1,000,000 agents simulated. By adjusting conditional probabilities corresponding to interventions, I am able to simulate the effects of those interventions. This methodology provides an effective tool for investigating the impact of interventions and thus provides guidance for public policy related to alcohol use.



(Seminar No. 17)

SPEAKER: Dr. Philip Rosenberg
Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH
Rockville, MD 20852-4910, U.S.A.

TITLE: Proportional Hazards Models and Age-Period-Cohort Analysis of Cancer Rates

TIME AND PLACE:  Thursday, March 12, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: Age-period-cohort (APC) analysis is widely used in cancer epidemiology to model trends in cancer rates. We develop methods for comparative APC analysis of two independent cause-specific hazard rates assuming that an APC model holds for each one. We construct linear hypothesis tests to determine whether the two hazards are absolutely proportional, or proportional after stratification by cohort, period, or age. When a given proportional hazards model appears adequate, we derive simple expressions for the relative hazards using identifiable APC parameters. We also construct a linear hypothesis test to assess whether the logarithms of the fitted age-at-event curves are parallel after adjusting for possibly heterogeneous period and cohort effects, a relationship that can hold even when the expected hazard rates are not proportional. To assess the utility of these new methods, we surveyed cancer incidence rates in Blacks versus Whites for the leading cancers in the United States, using data from the National Cancer Institute's Surveillance, Epidemiology, and End Results Program. Our comparative survey identified cancers with parallel and crossing age-at-onset curves, cancers with rates that were proportional after stratification by cohort, period, or age, and cancers with rates that were absolutely proportional. Proportional hazards models provide a useful statistical framework for comparative APC analysis.



(Seminar No. 18)

SPEAKER: Dr. Hormuzd Katki
Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, DHHS
Rockville, MD 20852-4910, U.S.A.

TITLE: Insights into p-values and Bayes Factors from False Positive and False Negative Bayes Factors

TIME AND PLACE:  Thursday, March 26, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: The Bayes Factor has stronger theoretical justification than p-values for quantifying statistical evidence, but when the goal is hypothesis testing, the Bayes Factor yields no insight about false positive vs. false negative results. I introduce the False Positive Bayes Factor (FPBF) and the False Negative Bayes Factor (FNBF) and show that they are approximately the two components of the Bayes Factor. In analogy to diagnostic testing, the FPBF and FNBF provide additional insight not obvious from the Bayes Factor. FPBF & FNBF require only the p-value and the power under an alternative hypothesis, forging a new link of p-values to Bayes Factors. This link can be exploited to understand differences in inferences drawn by Bayes Factors versus p-values. In a genome-wide association study of prostate cancer, FPBF & FNBF help reveal the two SNP mutations declared positive by p-values and Bayes Factors that with future data turned out to be false positives.



(Seminar No. 20)

SPEAKER: Dr. Hiro Hikawa
Department of Statistics, George Washington University
Washington, DC 20052, U.S.A.

TITLE: Robust Peters-Belson Type Estimators of Measures of Disparity and their Applications in Employment Discrimination Cases

TIME AND PLACE:  Thursday, April 16, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: In discrimination cases concerning equal pay, the Peters-Belson (PB) regression method is used to estimate the pay disparities between minority and majority employees after accounting for major covariates (e.g., seniority, education). Unlike the standard approach, which uses a dummy variable to indicate protected group status, the PB method first fits a linear regression model for the majority group. The resulting regression equation is then used to predict the salary of each minority employee by using their individual covariates in the equation. The difference between the actual and the predicted salaries of each minority employee estimates the pay differential for that minority employee, which takes into account legitimate job-related factors. The average difference estimates a measure of pay disparity. In practice, however, a linear regression model may not be sufficient to capture the actual pay-setting practices of the employer. Therefore, we use a locally weighted regression model in the PB approach as a specific functional form of the relationship between pay and relevant covariates is no longer needed. The statistical properties of the new procedure are developed and compared to those of the standard methods. The method also extends to the case with a binary (1-0) response, e.g., hiring or promotion. Both simulation studies and re-analysis of actual data show that, in general, the locally weighted PB regression method reflects the true mean function more accurately than the linear model, especially when the true function is not a linear or logit (for a 1-0 response) model. Moreover, only a small loss of efficiency is incurred when the true relation follows a linear or logit model.



(Seminar No. 21)

SPEAKER: Dr. Tsong Yi (Bio)
Division of Biometrics VI, OB/OTS/CDER, FDA
Silver Spring, MD 20993, U.S.A.

TITLE: Multiple Testing Issues in Thorough QTc Clinical Trials

TIME AND PLACE:  Thursday, April 23, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: Clinical trial endpoint often measured repeatedly at multiple time points with the objective to show either that the test treatment is more effective than control treatment at at-least one time point or to show that it is more effective than control treatment at all time points. With either objective, it involves with multiple comparisons and the issues of type I error rate control. We illustrate the problem with the example of thorough QT clinical trials. The ICH E14, 2005 defined that drug-induced prolongation of QT interval as evidenced by an upper bound of the 95% confidence interval around the mean effect on QTc of 10 ms. Further more it defined that a negative thorough QT/QTc study is one in which the upper bound of the 95% one-sided confidence interval for the largest time-matched mean effect of the drug on the QTc interval excludes 10 ms. It leads to the requirement of showing non-inferiority of the test treatment to placebo at multiple time points. Conventionally, it is carried out by testing multiple hypotheses at 5% type I error rate each. The multiple comparison concern of this analysis is conservativeness when the number of tests is many. On the other hand, when the study result is negative, ICH E14 recommended to validate the negative result by showing that the study population is sensitive enough to show at least 5 ms prolongation of QTc interval of a carefully selected positive control. The validation test is often carried out by demonstrating that the mean difference between positive control and placebo is greater than 5 ms at at-least one of the selected few time points. The multiple comparison nature of the validation test led to the concerns of type I error rate inflation. Both of the multiple comparison issue can be represented by the biasness of using the maximum of the estimates of treatment difference as the estimate of the maximum of the expected differences. We will discuss a few proposed approaches to address the problem.



(Seminar No. 22)

SPEAKER: Dr. Alan Dorfman Dr. Alan Dorfman
Bureau of Labor Statistics, U.S. Department of Labor
NE Washington, DC 20212-0001, U.S.A.

TITLE: Nonparametric Regression and the Two Sample Problem

TIME AND PLACE:  Thursday, April 30, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT: The two sample problem: two distinct surveys gather information on a variable y of interest from a single frame, differing perhaps in sample design and sample size, but with common auxiliary information x. How should we combine the data from the surveys to get a single estimate? Nonparametric regression: Models are often used in survey sampling to sharpen inference on y based on more complete knowledge of an auxiliary variable x. Because of the tentativeness of models in most circumstances, samplers typically buttress their model-based inference by embedding it in a design-based framework ("model assisted" estimation). An alternate approach is to use very weak models and nonparametric regression. A simple two sample problem is described and several approaches to handling it described. A simple, somewhat disguised version of nonparametric regression provides a nice solution. Some problematic and controversial aspects of nonparametric regression in survey sampling are discussed.



( Seminar No. 23)

SPEAKER: Prof. Andrew J. Waters
Uniformed Services University of the Health Sciences
Bethesda, MD 20814, U.S.A.

TITLE: Using ecological momentary assessment to study relapse in addiction

TIME AND PLACE:  Thursday, May 7, 2009, 3:30pm
             Room 1313, Math Bldg

ABSTRACT:

Rationale There has been growing interest in the use of handheld computers (PDAs) to collect behavioral data in a naturalistic or Ecological Momentary Assessment (EMA) setting. In many EMA studies, participants carry around a PDA with them as they go about their daily lives. They are beeped at random times on 4 or 5 occasions per day. When beeped, they complete items assessing subjective and contextual variables. Because each participant typically completes a fairly large number of assessments, EMA studies can generate large and complex datasets. The talk will first provide an overview of how EMA methods have been used to study addiction. I will also discuss a number of studies in which implicit cognitive assessments (reaction time tasks) have been administered on a PDA in an EMA setting. In an initial study, twenty-two smokers and 22 non-smokers carried around a PDA for 1-week (Waters & Li, 2008). They were beeped at random times on 4 occasions per day (RAs). At each assessment, participants responded to items assessing subjective, pharmacological, and contextual variables. They subsequently completed a Stroop task. In a second study, 30 participants completed an Implicit Association Test (IAT) at each assessment. In a third study, 68 heroin abusers undergoing drug detoxification in a detoxification clinic completed implicit/explicit cognitive assessments at each assessment. In a fourth study, 81 participants wishing to quit smoking have carried around a PDA for 1-week after their quit date. The talk will address: 1) The feasibility of assessing implicit/explicit cognitions on PDAs in an EMA setting; 2) The statistical methods that have been employed to analyze the EMA data; and 3) The unique associations between implicit/explicit cognitions and temptations/relapse that have been revealed in EMA data.



(Fall 2009, Seminar No. 1)

SPEAKER: Prof. Nikolai Chernov
University of Alabama at Birmingham,
Birmingham, AL, 35294, U.S.A.

TITLE: Errors-in-variables regression models: Parameter estimates often have no moments

TIME AND PLACE:  Thursday, September 10, 2009, 3:30pm
              Room 1313, Math Bldg

ABSTRACT: In the studies of linear and nonlinear regression problems when both variables are subject to errors, it often happens that maximum likelihood estimates have infinite mean values and infinite variances, but they work well in practice. I will discuss these facts and their methodological implications.




Last updated May 19, 2009