How to handle messy CSV files π
πΈCleverCSV is a Python package for detecting the dialect of CSV files with high accuracy. It is modeled on the way in which a human would determine the dialect: by looking for patterns that result in a regular tabular structure with βclean dataβ in the cells (such as numbers, dates, etc.).
The package is designed to help a data scientist with messy CSV files. It also provides a handy command line tool that can standardize a messy file or generate Python code to import it.
βοΈInstallation
πDocs
π[Research] Wrangling Messy CSV Files by Detecting Row and TypePatterns
#clevercsv #csv
πΈCleverCSV is a Python package for detecting the dialect of CSV files with high accuracy. It is modeled on the way in which a human would determine the dialect: by looking for patterns that result in a regular tabular structure with βclean dataβ in the cells (such as numbers, dates, etc.).
The package is designed to help a data scientist with messy CSV files. It also provides a handy command line tool that can standardize a messy file or generate Python code to import it.
βοΈInstallation
pip install clevercsv[full]
πGitHubπDocs
π[Research] Wrangling Messy CSV Files by Detecting Row and TypePatterns
#clevercsv #csv