Fits topic models using varimax-rotated principal component analysis (PCA), following the "vintage factor analysis" approach of Rohe & Zheng (2020) <doi:10.48550/arXiv.2004.05387>. Leverages truncated PCA via 'irlba' for sparse matrices, enabling fast model fitting on large corpora. Includes an information-theoretic approach to vocabulary selection, 'broom'-compatible tidiers for extracting word-topic and topic-document matrices into a tidy data workflow, and samplers for constructing simulated corpora for benchmarking and method evaluation.
| Version: | 0.1.1 |
| Depends: | R (≥ 4.1.0) |
| Imports: | assertthat, purrr, dplyr, tidyr, magrittr, rlang, stringr, tibble, tidyselect, irlba, tidytext, glue, Matrix, generics, psych, cli |
| Suggests: | testthat (≥ 3.0.0), knitr, rmarkdown, ggbeeswarm, ggplot2, Rtsne, umap, lpSolve, janeaustenr, stm, tictoc, furrr, reshape2, tmfast.realbooks |
| Published: | 2026-05-30 |
| DOI: | 10.32614/CRAN.package.tmfast (may not be active yet) |
| Author: | D. Hicks |
| Maintainer: | D. Hicks <hicks.daniel.j at gmail.com> |
| BugReports: | https://github.com/dhicks/tmfast/issues |
| License: | GPL (≥ 3) |
| URL: | https://dhicks.github.io/tmfast/, https://github.com/dhicks/tmfast |
| NeedsCompilation: | no |
| Additional_repositories: | https://dhicks.github.io/drat/ |
| Materials: | README, NEWS |
| CRAN checks: | tmfast results |
| Reference manual: | tmfast.html , tmfast.pdf |
| Vignettes: |
Fast topic modeling with real books (source, R code) Fitting topic models with tmfast (source, R code) |
| Package source: | tmfast_0.1.1.tar.gz |
| Windows binaries: | r-devel: not available, r-release: not available, r-oldrel: not available |
| macOS binaries: | r-release (arm64): tmfast_0.1.1.tgz, r-oldrel (arm64): tmfast_0.1.1.tgz, r-release (x86_64): tmfast_0.1.1.tgz, r-oldrel (x86_64): tmfast_0.1.1.tgz |
Please use the canonical form https://CRAN.R-project.org/package=tmfast to link to this page.