ucimlrepo v0.0.2 Released - Graceful Speedy Data

software-releases
r-package
Author

James Balamuta

Published

September 2, 2024

The ucimlrepo package has just received it’s second release, v0.0.2! This release focuses on improving the user experience and avoiding the wraith of CRAN when fetching datasets from the UCI Machine Learning Repository. We’ve also made some internal changes to speed up the package for large datasets.

This package is an unoffical port of the Python ucimlrepo package.

For details, see the ucimlrepo news file entry below:

ucimlrepo news file entry for version 0.0.2 (2024-09-02)

Features

  • Improved graceful errors for fetch_ucirepo() and list_available_datasets() when resources are not found/available. (#3, thanks Prof. Ripley!)
  • Speed up fetch_ucirepo() for large data frames by switching to using base functionals instead of growing a vector in a loop while sorting variable roles. (#6)

Bug fixes

  • Fixed internal subset issue with fetch_ucirepo() when metadata from the repository had whitespace characters in the variable names. (#2)

GitHub Changelog