Cluster Creation in HDInsight - Number of Nodes RRS feed

  • Question

  • Hi,

    We are trying to do statistical analysis (Logistic Regression/ Linear Regression/ Decision Tree etc.) on large volume of data using R server cluster on HDInsight.

    What are the factors one should consider to decide the number of nodes in cluster creation? What are the best practices?

    How volume metric of data decides number of nodes?

    Please help.



    Thursday, August 3, 2017 10:55 AM