Fit a clustering-only bulk-tail model
dpmgpd.cluster.RdVariant of dpmix.cluster() that augments the cluster kernel with a generalized Pareto tail.
This is the clustering analogue of the spliced bulk-tail workflow used by dpmgpd().
Arguments
- formula
Model formula. The response must be present in
data.- data
Data frame containing the response and optional predictors.
- type
Clustering mode:
"weights": links mixture weights to predictors"param": links kernel parameters to predictors"both": links both weights and kernel parameters to predictors
- default
Default mode used when
typeis omitted.- mcmc
MCMC control list passed into the cluster bundle.
- ...
Additional arguments passed to
build_cluster_bundle(), including kernel settings, prior overrides, component counts, and monitoring controls.
Details
For observations above a component-specific threshold, the component density is spliced as $$ f(y) = (1 - F_{bulk}(u)) g_{GPD}(y \mid u, \sigma_u, \xi_u), \qquad y \ge u, $$ so cluster assignment can be informed by both central behavior and tail behavior.
This interface is preferable when cluster separation is driven by upper-tail differences rather than bulk-only shape or location differences.
See also
dpmix.cluster(), predict.dpmixgpd_cluster_fit(),
dpmgpd(), sim_bulk_tail().
Other cluster workflow:
dpmix.cluster(),
plot.dpmixgpd_cluster_bundle(),
plot.dpmixgpd_cluster_fit(),
plot.dpmixgpd_cluster_labels(),
plot.dpmixgpd_cluster_psm(),
predict.dpmixgpd_cluster_fit(),
print.dpmixgpd_cluster_bundle(),
print.dpmixgpd_cluster_fit(),
print.dpmixgpd_cluster_labels(),
print.dpmixgpd_cluster_psm(),
summary.dpmixgpd_cluster_bundle(),
summary.dpmixgpd_cluster_fit(),
summary.dpmixgpd_cluster_labels(),
summary.dpmixgpd_cluster_psm()