'Microsoft Word' 'docx' files provide an 'XML' structure that is fairly
straightforward to navigate, especially when it applies to 'Word' tables and
comments. Tools are provided to determine table count/structure, comment count
and also to extract/clean tables and comments from 'Microsoft Word' 'docx' documents.
There is also nascent support for '.doc' and '.pptx' files.
Version: |
0.6.5 |
Depends: |
R (≥ 3.6.0) |
Imports: |
tools, xml2, purrr, dplyr, utils, httr, magrittr |
Suggests: |
covr, tinytest |
Published: |
2020-07-05 |
Author: |
Bob Rudis [aut,
cre],
Mark Dulhunty [ctb],
Karlo Guidoni-Martins [ctb],
Chris Muir [aut, ctb],
John Muschelli [ctb] |
Maintainer: |
Bob Rudis <bob at rud.is> |
BugReports: |
https://gitlab.com/hrbrmstr/docxtractr/issues |
License: |
MIT + file LICENSE |
URL: |
http://gitlab.com/hrbrmstr/docxtractr |
NeedsCompilation: |
no |
SystemRequirements: |
LibreOffice (<https://www.libreoffice.org/>)
required to extract data from .doc files or perform .pptx
conversion. |
Materials: |
NEWS |
CRAN checks: |
docxtractr results |