<?xml version="1.0" encoding="UTF-8"?>
<oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:title>Forest Discretization</dc:title>
  <dc:title>R package ForestDisc version 0.1.0</dc:title>
  <dc:description>Supervised, multivariate, and non-parametric discretization algorithm based on tree ensembles learning and moment matching optimization. This version of the algorithm relies on random forest algorithm to learn a large set of split points that conserves the relationship between attributes and the target class, and on moment matching optimization to transform this set into a reduced number of cut points matching as well as possible statistical properties of the initial set of split points. For each attribute to be discretized, the set S of its related split points extracted through random forest is mapped to a reduced set C of cut points of size k. This mapping relies on minimizing, for each continuous attribute to be discretized, the distance between the four first moments of S and the four first moments of C subject to some constraints. This non-linear optimization problem is performed using k values ranging from 2 to 'max_splits', and the best solution returned correspond to the value k which optimum solution is the lowest one over the different realizations. ForestDisc is a generalization of RFDisc discretization method initially proposed by Berrado and Runger (2009) &lt;doi:10.1109/AICCSA.2009.5069327&gt;, and improved by Berrado et al. in 2012 by adopting the idea of moment matching optimization related by Hoyland and Wallace (2001) &lt;doi: 10.1287/mnsc.47.2.295.9834&gt;.</dc:description>
  <dc:type>Software</dc:type>
  <dc:relation>Imports: randomForest, nloptr, moments, stats</dc:relation>
  <dc:creator>Haddouchi Maïssae &lt;maissaem7@gmail.com&gt;</dc:creator>
  <dc:publisher>Comprehensive R Archive Network (CRAN)</dc:publisher>
  <dc:contributor>Haddouchi Maïssae</dc:contributor>
  <dc:rights>GPL (&gt;= 3)</dc:rights>
  <dc:date>2020-03-19</dc:date>
  <dc:format>application/tgz</dc:format>
  <dc:identifier>https://CRAN.R-project.org/package=ForestDisc</dc:identifier>
  <dc:identifier>doi:10.32614/CRAN.package.ForestDisc</dc:identifier>
</oai_dc:dc>
