by Diego Calvo | Sep 4, 2018 | Apache Hadoop, Big Data
Kerberos definition Kerberos is an authentication protocol that allows two computers to demonstrate their identity mutually in a secure way. Implemented on a client server architecture and works on the basis of tickets that serve to demonstrate the identity of the... by Diego Calvo | Sep 4, 2018 | Big Data
Access the cluster by SSH ssh user_name@server_cluster_name Authentication in the Shell Kinit user_name@REINO.COM If authentication is successful, we will receive a ticket-granding ticket (TGT) from the KDC. This means that we have authenticated with the server, but... by Diego Calvo | Aug 27, 2018 | Apache Spark, Big Data, Scala-example
Filter data with like Filtering is made to select the people whose surname contains “Garc” and which age is under 30. val df = sc.parallelize(Seq( (“Paco”,”Garcia”,24,24000,”2018-08-06 00:00:00″),... by Diego Calvo | Aug 19, 2018 | Big Data, Data bases
Prerequisites of Apache Sqoop Examples The prerequisites for these examples are the same as for the previous post of Sqoop. These examples create a database “myddbb” and a table with values entered “mytable” and another empty table... by Diego Calvo | Aug 17, 2018 | Apache Spark, Big Data, Scala-example
The following post shows the steps to recreate an example of linear regression in Scala. Set the data set Defines the set of data to apply to the model. import org.apache.spark.ml.linalg.Vectors val df = spark.createDataFrame(Seq( (0, 60), (0, 56), (0, 54), (0, 62),...