R interface for the Google Cloud Services 'Document AI API' <https://cloud.google.com/document-ai/> with additional tools for output file parsing and text reconstruction. 'Document AI' is a powerful server-based OCR service that extracts text and tables from images and PDF files with high accuracy. 'daiR' gives R users programmatic access to this service and additional tools to handle and visualize the output. See the package website <https://dair.info/> for more information and examples.
| Version: | 1.0.1 |
| Depends: | R (≥ 4.2.0) |
| Imports: | base64enc, beepr, cli, data.table, fs, gargle, glue, googleCloudStorageR, graphics, grDevices, httr, jsonlite, lifecycle, magick, pdftools, purrr, readtext, stats, stringr, utils, xml2 |
| Suggests: | knitr, ngram, rmarkdown, testthat (≥ 3.1.10) |
| Published: | 2024-11-13 |
| DOI: | 10.32614/CRAN.package.daiR |
| Author: | Thomas Hegghammer |
| Maintainer: | Thomas Hegghammer <hegghammer at gmail.com> |
| BugReports: | https://github.com/Hegghammer/daiR/issues |
| License: | MIT + file LICENSE |
| URL: | https://github.com/Hegghammer/daiR, https://dair.info |
| NeedsCompilation: | no |
| Materials: | README, NEWS |
| CRAN checks: | daiR results |
| Reference manual: | daiR.html , daiR.pdf |
| Vignettes: |
Complex file and folder management (source, R code) Configuration (source, R code) Working with Google Cloud Storage (source, R code) Quickstart (source, R code) Correcting text output (source, R code) Extracting tables (source, R code) Basic usage (source, R code) |
| Package source: | daiR_1.0.1.tar.gz |
| Windows binaries: | r-devel: daiR_1.0.1.zip, r-release: daiR_1.0.1.zip, r-oldrel: daiR_1.0.1.zip |
| macOS binaries: | r-release (arm64): daiR_1.0.1.tgz, r-oldrel (arm64): daiR_1.0.1.tgz, r-release (x86_64): daiR_1.0.1.tgz, r-oldrel (x86_64): daiR_1.0.1.tgz |
| Old sources: | daiR archive |
Please use the canonical form https://CRAN.R-project.org/package=daiR to link to this page.