vd
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
vd [2021/08/09 22:58] – [Case Study: Exported CSV from PG&E] dblume | vd [2024/05/13 11:23] (current) – [Process Data] dblume | ||
---|---|---|---|
Line 25: | Line 25: | ||
| ; | Extract regex to new column. Ex, '' | | ; | Extract regex to new column. Ex, '' | ||
| %%^%% | rename the column. Might have to be " | | %%^%% | rename the column. Might have to be " | ||
+ | | = | Use Python function to create new column. Ex, hex to dec: '' | ||
+ | | : | Split column by regex | | ||
| - | Hide column | | | - | Hide column | | ||
| S | Go to " | | S | Go to " | ||
Line 34: | Line 36: | ||
| " | Open duplicate sheet with only selected rows | | | " | Open duplicate sheet with only selected rows | | ||
- | ==== Case Study: Exported CSV from PG&E ==== | + | ===== Inspecting Columnar Data ===== |
+ | |||
+ | ^ Key ^ Meaning ^ | ||
+ | | I | Describe all columns, errors, distinct, mode, mean, median, stdev, etc. | | ||
+ | | i | Add a column of incrementing numbers (useful for ' | ||
+ | | . | Requires an " | ||
+ | | O | Options to enable " | ||
+ | | F | Frequency table of row counts, or histogram if numeric_binning is true | | ||
+ | |||
+ | Calculating a percentage-of-total column for a numeric column: | ||
+ | |||
+ | ^ Key ^ Meaning ^ | ||
+ | | # | Set column type to " | ||
+ | | I | Describe all columns. (Highlight the " | ||
+ | | ~ | Convert that column to " | ||
+ | | zy | Yank the value of the sum. | | ||
+ | | q | Quit the Describe sheet. | | ||
+ | | = | New column. Enter '' | ||
+ | ====== Case Study Link: Exported CSV from PG& | ||
Visit [[vd-pge]]. | Visit [[vd-pge]]. | ||
+ | |||
+ | ====== Cast Study: Merging Two Tables, logs and metadata ====== | ||
==== Protip: Use column view to set multiple columns at once ==== | ==== Protip: Use column view to set multiple columns at once ==== | ||
Line 106: | Line 128: | ||
$ vd --play=my_cmdlog.vd --replay-wait=0.5 | $ vd --play=my_cmdlog.vd --replay-wait=0.5 | ||
+ | ====== Lists in Cells for Frequency Tables ====== | ||
+ | Sometimes you want one of the columns in a Frequency Table to be a list of unique values. Let's say the column title is " | ||
+ | |||
+ | ^ Key ^ Meaning ^ | ||
+ | | + | Set the aggregator to " | ||
+ | | F | Make a Frequency Table for the selected column. (gF for selected columns) | | ||
+ | | =, ',' |
vd.1628575098.txt.gz · Last modified: 2023/04/12 20:44 (external edit)