Oracle Big Data Preparation Cloud Service allows me to quickly create a transform. Once created, transforms are viewed on their Authoring Page. The components of this page are: a Script Panel, a Recommendations panel, view of Sample Data or Metadata, and a profile drawer that can be opened or closed at any time.
The Transform Script panel contains all the actions I perform to repair and enrich my data. Initially, this panel is populated with actions that the recommendations engine has automatically applied to the data set. I can review these actions and remove any that I don’t want in my transform script. As I apply further recommendations and perform tasks to prepare data, this panel gets populated with those actions and the update is displayed in the Sample Data instantly. I can remove an action at any point in time.
The recommendations panel contains all the actions suggested by the recommendation engine. Initially, all recommendations for the entire data set are listed. When I select a particular column from the data sheet, the recommendations panel lists suggestions only for that specific column. I can also view recommendations from the toolbar in the sample data panel. I select the recommendation I want to review and apply it if required.
Metadata is the default view for a transform. The column names and their type are listed as columns along with some sample values for the column. The status identifies the columns that have been modified or added and alerts me of columns with private or sensitive data. Each column has a context menu that allows me to perform various tasks on that column to repair or enrich it. Using the toolbar I can: change the view of the sample data to a spreadsheet view and back to the metadata view, undo or redo my actions, delete a column, or perform a duplicate analysis to identify records that have the same value for a particular column or search for a specific column name.
In the profile drawer, the first page shows a complete analysis of my entire dataset; and in the next page, I see statistics for a single column. This helps me analyze the data and prepare the information accordingly. If I perform a duplicate analysis, the results are displayed in the last page of the profile drawer. I can add multiple data files to this transform by performing a blend. After I finish editing the transform script, I save the transform and return to the Catalog page.