Modern Scala projects : leverage the power of Scala for building data-driven and high-performant projects
Record details
- ISBN: 9781788624114
- ISBN: 1788624114
- ISBN: 9781788624114
- ISBN: 1788625277
- ISBN: 9781788625272
-
Physical Description:
1 online resource : illustrations
remote - Publisher: Birmingham, UK : Packt Publishing, 2018.
Content descriptions
Bibliography, etc. Note: | Includes bibliographical references. |
Formatted Contents Note: | Implementation objective 2- deriving a dataframe for EDAStep 1 -- conducting preliminaryEDA; Step 2 -- loading data and converting it to an RDD[String]; Step 3 -- splitting the resilient distributed dataset and reorganizing individual rows into an array; Step 4 -- purging the dataset of rows containing question mark characters; Step 5 -- running a count after purging the dataset of rows with questionable characters; Step 6 -- getting rid of header; Step 7 -- creating a two-column DataFrame; Step 8 -- creating the final DataFrame; Random Forest breast cancer pipeline |
Source of Description Note: | Online resource; title from title page (Safari, viewed August 27, 2018). |
Search for related items by subject
Genre: | electronic book > ebook |