Estimation of tail parameters with missing largest observations

Research output: Contribution to journalJournal articleResearchpeer-review

Documents

  • Fulltext

    Final published version, 4.97 MB, PDF document

The setting where an unknown number m of the largest data is missing from an underlying Pareto-type distribution is considered. So-lutions are provided for estimating the extreme value index, the number of missing data and extreme quantiles. Asymptotic results of the parameter estimators and an adaptive selection method for the number of top data used in the estimation are proposed for the case where all missing data are beyond the observed data. An estimator of the number of missing extremes spread over the largest observed data is also proposed. To this purpose, a key component is a likelihood solution based on exponential representations of spacings between the largest observations. An effective and fast optimization procedure is established using regularization, and simulation experiments are provided. The methodology is illustrated with a dataset from the diamond mining industry, where large-carat diamonds are expected to be missing.

Original languageEnglish
JournalElectronic Journal of Statistics
Volume17
Issue number2
Pages (from-to)3728-3761
ISSN1935-7524
DOIs
Publication statusPublished - 2023

Bibliographical note

Publisher Copyright:
© 2023, Institute of Mathematical Statistics. All rights reserved.

    Research areas

  • Extreme value index, high quantiles, missing observations, regularization

ID: 380304082