What’s in the data viewer
When you open a dataset, it’s displayed in the data viewer.
The default view includes:
The metadata associated with this dataset (description, time last updated, and number of rows and columns).
The collection path, showing the location of the dataset within the collection hierarchy.
The data records within the selected dataset.
The Statistics button provides information about each column (see Viewing statistics), while the Export and API buttons let you download the current dataset or view snapshot information in JSON format (see Downloading data). You must be signed in to Enigma Public to use the Export and API features.
Tabs for selecting different views of the metadata:
- Overview: The default view described above
- Lineage: The source URL and the date and time the dataset was created and last updated (if applicable)
- Fields: See Viewing field information
|The data viewer displays a maximum of 10,000 rows. If a dataset has more than 10,000 rows, you’ll need to download it to view all rows.|
Viewing field information
Clicking the Fields tab displays information about the fields in the current dataset.
On the left of the screen, you’ll see:
The field name (click the name to scroll to that column if it isn’t already visible)
The data type (text, date, or numeric)
A description of the field
If you’re using a dataset that does not include field descriptions and you’re interested in annotating the dataset to help other Enigma Public users, contact firstname.lastname@example.org.
Working with cell data
You can copy the information from one or more cells by first selecting the cells by clicking or dragging, and then pressing Ctrl-C (PC) or Command-C (Mac). You can then paste the text into another application.
Right-clicking on a cell lets you filter data or launch a global search based on the cell value. For details, see [Filtering and searching on a cell value][public_v20_user_filtering#filter_search_cell].
To hide columns you don’t want to see, click the Hide button in the column header.
When you hide a column, a narrow spacer remains in its place. Click the spacer to restore the column.
By default, rows are displayed in the order they appear in the underlying data file. You can sort the rows in any order by clicking the Sort button at the top of the column you want to use. When you click the button, choose Descending to sort the rows in descending order, or Ascending to sort the rows in ascending order.
When you’ve selected a column for sorting, you’ll see the corresponding sort icon displayed in the column heading.
The Statistics button lets you view statistics for each of the columns based on the data in the rows in the current view. In the example below, the rows are filtered to display only rows where Oil (bbls) is greater than or equal to 100,000 and there are 1,130 matching rows . The statistics are for those rows.
Statistics are displayed for each field (column) and are for the “matching rows” only – not the entire dataset. If you want statistics for the entire dataset, remove (‘x’) each of the applied filters.
The information displayed above each column depends on the data type associated with the field (text, date, or numeric):
- For text fields, you’ll see frequency information. In the example, the first column is a text field and there are 12 instances of “608164040500” within the column.
- For date fields, you’ll also see frequency information. In the example, the second column is a date field and there are 111 instances of “2014-07-01” within the column.
- For numeric fields, you’ll see calculated statistics for the column. In the example, the third column is a numeric field where the sum of the row values is 269,672,669.
To close the summary view, click the Statistics button again.