mzutils: Toolkit Functions to Make Life Easier

mzutils is a personal toolkit that contains various methods to do miscellaneous jobs related to data cleaning, preprocessing, visualizing, evaluating and modeling in various fields of machine learning, especially NLP and RL. It supports Datasets like SQuAD, GLUE, Deepmind CNN/DailyMail in NLP and d4rl in offlineRL, as well as their conversions to JSON, CSV, numpy arrays and tensors.

The purpose of this toolkit is to reuse the code and make Mohan Zhang’s life easier.

Read the documentation here!


  • if you want to use only the minimal functionalities, please do

$ pip install -U mzutils
  • or if you want to use nlp specific functions, please do

$ pip install -U mzutils[nlp]
  • or if you want to use all the functionalities, please do

$ pip install -U mzutils[all]
$ pip install -r security_requirements.txt


due to this unreasonable feature of pip, you will need to check and install from requirements.txt, extras_requirements.txt or security_requirements.txt according to your specific need.


Indices and tables