#br# Distributed Big Data Machine Learning Algorithms Based on Spark
(1. School of Information Engineering, Zhengzhou University, Zhengzhou 450001, China;
2. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China)
WANG Rui1, HAN Rui2, Jia Yu-xiang1. #br# Distributed Big Data Machine Learning Algorithms Based on Spark[J]. Computer and Modernization, 2018, 0(11): 119-.
[1] SAI J, WANG B, WU B. BPPGD: Budgeted parallel primal gradient descent kernel SVM on Spark[C]// IEEE International Conference on Data Science in Cyberspace(DSC). 2016:74-79.
[2] BORTHAKUR D. The Hadoop distributed file system: Architecture and design[J]. Hadoop Project Website, 2007,11(11):1-10.
[3] ZAHARIA M, CHOWDHURY M, FRANKLIN M J, et al. Spark: Cluster computing with working sets[C]// Usenix Conference on Hot Topics in Cloud Computing. 2010,15(1):10.
[4] VAVILAPALLI V K, MURTHY A C,DOUGLAS C, et al. Apache Hadoop YARN: Yet another resource negotiator[C]// Proceedings of the 4th Annual Symposium on Cloud Computing. ACM, 2013:1-16.
[5] 〖KG-*3〗BORTHAKUR D. HDFS Architecture Guide[EB/OL]. Hadoop Apache Project [2018-09-07]. http://hadoop.apache.org/docs/r1.2.1/hdfs_design.html.
[6] ZAHARIA M, CHOWDHURY M, DAS T, et al. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing[C]// Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. 2012:2-2.
[7] GitHub. SWIM[EB/OL]. [2018-05-03]. https://github.com/SWIMProjectUCB/SWIM.
[8] PERCY M. Collaborative Filtering for Netflix[D]. Santa Cruz: Jack Baskin School of Engineering, 2009.
[9] KOLLIOS G, GUNOPULOS D, KOUDAS N, et al. Efficient biased sampling for approximate clustering and outlier detection in large data sets[J]. IEEE Transactions on Knowledge and Data Engineering, 2003,15(5):1170-1187.
[10]BOTTOU L. Large-scale machine learning with stochastic gradient descent[C]// Proceedings of COMPSTAT’2010. 2010:177-186.
[11]HECHT-NIELSEN R. Theory of the backpropagation neural network[C]// International Joint Conference on Neural Networks. 1989:593-605.
[12]Wikipedia. Rectifier (Neural Networks) [EB/OL]. [2018-05-03]. https://en.wikipedia.org/wiki/Rectifier_(neural_networks).
[13]Wikipedia. Sigmoid Function[EB/OL]. [2018-05-03]. https://en.wikipedia.org/wiki/Sigmoid_function.
[14]ZHU Z A, CHEN W, WANG G, et al. P-packSVM: Parallel primal gradient descent kernel SVM[C]// IEEE the 9th IEEE International Conference on Data Mining. 2009:677-686.
[15]Wikipedia. Euclidean_Distance[EB/OL]. [2018-05-03]. https://en.wikipedia.org/wiki/Euclidean_distance.
[16]Nodalpoint. Nonlinear Neural Network[EB/OL]. [2018-05-03]. https://www.nodalpoint.com/nonlinear-regression-using-spark-part-1-nonlinear-models/.
[17]Wikipedia. Residual Sum of Squares[EB/OL]. [2018-05-03]. https://en.wikipedia.org/wiki/Residual_sum_of_squares.