Methods for “A Fast Ecological Inference Algorithm for the RxC case”.

The following library includes a method (run_em) to solve the R×C Ecological Inference problem for the non-parametric case by using the EM algorithm with different approximation methods for the E-Step. The standard deviation of the estimated probabilities can be computed using bootstrapping (bootstrap).

It also provides a function that generates synthetic election data (simulate_election) and a function that imports real election data (chilean_election_2021) from the Chilean first-round presidential election of 2021.

The setting in which the documentation presents the Ecological Inference problem is an election context where for a set of ballot-boxes we observe (i) the votes obtained by each candidate and (ii) the number of voters of each demographic group (for example, these can be defined by age ranges or sex). See Thraves, C., Ubilla, P., Hermosilla, D. (2024): “A Fast Ecological Inference Algorithm for the R×C Case”..

The methods to compute the conditional probabilities of the E-Step included in this package are the following:

On average, the Multinomial method is the most efficient and precise. Its precision matches the Exact method.

The documentation uses the following notation:

To learn more about fastei, please consult the available vignettes:

browseVignettes("fastei")

Aditionally, it is possible to browse the full documentation on the library website. ### Installation

As of now, it can only be installed from source. Support for Fortran is needed, however, R usually ships with it already. OpenMP is optional but highly suggested. The Makevars can be usually found on ~.R/Makevars, where it is possible to add the corresponding/missing flags.