The fuzzywuzzyR package is a fuzzy string matching implemenation of the fuzzywuzzy python package. It uses the Levenshtein Distance to calculate the differences between sequences. More details on the functionality of fuzzywuzzyR can be found in the package Vignette.
Python (>= 2.4)
difflib
fuzzywuzzy ( >=0.15.0 )
python-Levenshtein ( >=0.12.0, optional, provides a 4-10x speedup in String Matching, though may result in differing results for certain cases)
Before the installation of any python modules one should check the python-configuration using :
reticulate::py_config()
All modules should be installed in the default python configuration (the configuration that the R-session displays as default), otherwise errors will occur during package installation.
Python2
sudo apt-get install python-pip
sudo pip install --upgrade pip
pip install fuzzywuzzy
pip install python-Levenshtein
Python 3
sudo apt-get install python3-pip
sudo pip3 install --upgrade pip
pip3 install fuzzywuzzy
pip3 install python-Levenshtein
sudo easy_install pip
sudo pip install fuzzywuzzy
sudo pip install python-Levenshtein
Update of the Environment variables ( Control Panel >> System and Security >> System >> Advanced system settings >> Environment variables >> System variables >> Path >> Edit ) by adding ( for instance in case of python 2.7 ) :
C:\Python27;C:\Python27\Scripts
Open the Command prompt and use the following commands:
pip install fuzzywuzzy
pip install python-Levenshtein
To install the package from CRAN use,
install.packages('fuzzywuzzyR')
and to download the latest version from Github use the install_github function of the devtools package,
devtools::install_github(repo = 'mlampros/fuzzywuzzyR')
Use the following link to report bugs/issues,
https://github.com/mlampros/fuzzywuzzyR/issues