MRMD2.0 feature ranking and reduction
Manual
1. Installation:
pip3 install -r requirements.txt --ignore-installed
2. usage:
python3 mrmd2.0.py -i input.csv -s start_index -e end_index -l Step_length -o metrics.csv -c Dimensionalized_dataset.csv
note: The program can select a certain interval of the data set feature sequence for dimensionality reduction, so you need to select the specified parameter -s, -e. If you want to reduce the dimension of the entire data set, you only need to specify -s 1 , -e -1
-
-i the input dataset, supports csv,arff and libsvm
-
-s the location where the user specified interval begins (default 1)
-
-e User-specified interval ending position (default -1)
-
-l step length (default 1,Larger steps will execute faster, and smaller results will be better.)
-
-o Some indicators of the dimensionality reduction data set
-
-b classifier, default=1, RrandomForest=1, SVM=2, Bayes=3
-
-r rank_method, default=1, PageRank = 1,HITS:Authority = 2,HITS:Hub = 3,LeaderRank = 4,TrustRank=5
-
-c Dimensionalized data set
The data output by the terminal can be found in the Logs directory. Please find the results in 'Results' folder.
3. Example
- Test.csv is a 150-dimensional dataSet
- First select a dimension reduction interval (here from the first feature to the 150th feature, that is, the dimension reduction of the entire feature data set, of course, you can also choose one of the other continuous feature intervals)
- Step size is set to 1
python3 mrmd2.0.py -i test.csv -o metrics.csv -c Dimensionalized_dataset.csv
python3 mrmd2.0.py -i test.arff -o metrics.csv -c Dimensionalized_dataset.arff
python3 mrmd2.0.py -i test.libsvm -o metrics.csv -c Dimensionalized_dataset.libsv