What’s in the data viewer
|The data viewer displays a maximum of 10,000 rows. If a dataset has more than 10,000 rows, you’ll need to download it to view all rows.|
When you open a dataset, it’s displayed in the data viewer.
The default view includes:
The metadata associated with this dataset (description, time last updated, and number of rows and columns).
The collection path, showing the location of the dataset within the collection hierarchy.
The data records within the selected dataset.
Buttons to access additional options:
- The Statistics button provides information about each column (see Viewing statistics).
- The Visualize in Tableau button displays information about using the Enigma Public Web Data Connector to import the current dataset into Tableau.
The Export and API buttons let you download the current dataset or view snapshot information in JSON format (see Downloading data).
You must be signed into Enigma Public to use the Tableau, Export and API features.
Tabs for selecting different views of the metadata:
- Overview: The default view described above
- Lineage: The source URL and the date and time the dataset was created and last updated (if applicable)
- Fields: See Viewing field information
Viewing field information
Clicking the Fields tab displays information about the fields in the current dataset.
On the left of the screen, you’ll see:
The field name (click the name to scroll to that column if it isn’t already visible)
The data type (text, date, or numeric)
A description of the field
If you’re using a dataset that does not include field descriptions and you’re interested in annotating the dataset to help other Enigma Public users, contact firstname.lastname@example.org.
Working with cell data
You can copy the information from one or more cells by first selecting the cells by clicking or dragging, and then pressing Ctrl-C (PC) or Command-C (Mac). You can then paste the text into another application.
Right-clicking on a cell lets you filter data or launch a global search based on the cell value. For details, see Filtering and searching on a cell value.
To hide columns you don’t want to see, click the Hide button in the column header.
When you hide a column, a narrow spacer remains in its place. Click the spacer to restore the column.
By default, rows are displayed in the order they appear in the underlying data file. You can sort the rows in any order by clicking the Sort button at the top of the column you want to use. When you click the button, choose Descending to sort the rows in descending order, or Ascending to sort the rows in ascending order.
When you’ve selected a column for sorting, you’ll see the corresponding sort icon displayed in the column heading.
The Statistics button lets you view statistics for each of the columns based on the data in the rows in the current view. In the example below, the rows are filtered to display only rows where Oil (bbls) is greater than or equal to 100,000 and there are 1,130 matching rows . The statistics are for those rows.
Statistics are displayed for each field (column) and are for the “matching rows” only – not the entire dataset. If you want statistics for the entire dataset, remove (‘x’) each of the applied filters.
The information displayed above each column depends on the data type associated with the field (text, date, or numeric):
- For text fields, you’ll see frequency information. In the example, the first column is a text field and there are 12 instances of “608164040500” within the column. Click an item to filter the dataset to display only rows with the selected value in that column.
- For date fields, you’ll also see frequency information. In the example, the second column is a date field and there are 111 instances of “2014-07-01” within the column.
- For numeric fields, you’ll see calculated statistics for the column. In the example, the third column is a numeric field where the sum of the row values is 269,672,669.
To close the summary view, click the Statistics button again.
Viewing a dataset as of a previous time
When you open a dataset in Enigma Public, you’ll see the most recent or “current” snapshot (see How data is organized). If the dataset has earlier snapshots, Enigma Public lets you select the version that was “current” as of a specified date and time in the past. For example, if you specify January 1, 2018 at 0:00:00, you’ll see the snapshot that was the latest on that date and time.
To view a previous snapshot:
- Open the dataset in the data viewer.
- Click the “As of” field beneath the dataset name.
- Select a date and time:
- To change the month or year, click the current month or year and select from the list, or use the or buttons to go back or advance by a month. Then click the date.
- To change the time, type it in using 24-hour clock format.
- Click Select Date. You’ll see the date and time of the snapshot you’re viewing displayed below the “As of” field.