vd
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
vd [2021/06/17 00:18] – dblume | vd [2024/05/13 11:23] (current) – [Process Data] dblume | ||
---|---|---|---|
Line 3: | Line 3: | ||
**[[https:// | **[[https:// | ||
- | ====== Tips ====== | + | ====== Source Table Recommendations |
- | + | ||
- | ===== Source Table Recommendations ===== | + | |
Gather data with more granularity than you think you need. You often need three tables: | Gather data with more granularity than you think you need. You often need three tables: | ||
Line 13: | Line 11: | ||
- **Mappings from product ID to product name**: (Maybe from platform ID or from product ID.) | - **Mappings from product ID to product name**: (Maybe from platform ID or from product ID.) | ||
- | ===== Open VisiData and Set Data Types ===== | + | ====== Open VisiData and Set Data Types ====== |
$ vd logs.csv active_devices.csv id_name.csv | $ vd logs.csv active_devices.csv id_name.csv | ||
- | ==== Set Column Types, Widths, Names, Importance ==== | + | ===== Set Column Types, Widths, Names, Importance |
^ Key ^ Meaning ^ | ^ Key ^ Meaning ^ | ||
Line 27: | Line 25: | ||
| ; | Extract regex to new column. Ex, '' | | ; | Extract regex to new column. Ex, '' | ||
| %%^%% | rename the column. Might have to be " | | %%^%% | rename the column. Might have to be " | ||
+ | | = | Use Python function to create new column. Ex, hex to dec: '' | ||
+ | | : | Split column by regex | | ||
| - | Hide column | | | - | Hide column | | ||
| S | Go to " | | S | Go to " | ||
Line 36: | Line 36: | ||
| " | Open duplicate sheet with only selected rows | | | " | Open duplicate sheet with only selected rows | | ||
- | === Protip: Use column view to set multiple columns at once === | + | ===== Inspecting Columnar Data ===== |
+ | |||
+ | ^ Key ^ Meaning ^ | ||
+ | | I | Describe all columns, errors, distinct, mode, mean, median, stdev, etc. | | ||
+ | | i | Add a column of incrementing numbers (useful for ' | ||
+ | | . | Requires an " | ||
+ | | O | Options to enable " | ||
+ | | F | Frequency table of row counts, or histogram if numeric_binning is true | | ||
+ | |||
+ | Calculating a percentage-of-total column for a numeric column: | ||
+ | |||
+ | ^ Key ^ Meaning ^ | ||
+ | | # | Set column type to " | ||
+ | | I | Describe all columns. (Highlight the " | ||
+ | | ~ | Convert that column to " | ||
+ | | zy | Yank the value of the sum. | | ||
+ | | q | Quit the Describe sheet. | | ||
+ | | = | New column. Enter '' | ||
+ | ====== Case Study Link: Exported CSV from PG&E ====== | ||
+ | |||
+ | Visit [[vd-pge]]. | ||
+ | |||
+ | ====== Cast Study: Merging Two Tables, logs and metadata ====== | ||
+ | |||
+ | ==== Protip: Use column view to set multiple columns at once ==== | ||
^ Key ^ Meaning ^ | ^ Key ^ Meaning ^ | ||
Line 53: | Line 77: | ||
{{: | {{: | ||
- | ===== Join Sheets ===== | + | ====== Join Sheets |
Use '' | Use '' | ||
Line 64: | Line 88: | ||
| gu | Next time you're in Sheets, " | | gu | Next time you're in Sheets, " | ||
- | === Protip: Remove insignificant noisy rows === | + | ==== Protip: Remove insignificant noisy rows ==== |
^ Key ^ Meaning ^ | ^ Key ^ Meaning ^ | ||
Line 76: | Line 100: | ||
Do a similar thing to join the Joined Sheet with the product Name sheet. | Do a similar thing to join the Joined Sheet with the product Name sheet. | ||
- | ===== Process Data ===== | + | ====== Process Data ====== |
In your final Joined sheet, add rows that are calculations of other rows. | In your final Joined sheet, add rows that are calculations of other rows. | ||
Line 104: | Line 128: | ||
$ vd --play=my_cmdlog.vd --replay-wait=0.5 | $ vd --play=my_cmdlog.vd --replay-wait=0.5 | ||
+ | ====== Lists in Cells for Frequency Tables ====== | ||
+ | Sometimes you want one of the columns in a Frequency Table to be a list of unique values. Let's say the column title is " | ||
+ | |||
+ | ^ Key ^ Meaning ^ | ||
+ | | + | Set the aggregator to " | ||
+ | | F | Make a Frequency Table for the selected column. (gF for selected columns) | | ||
+ | | =, ',' |
vd.1623914294.txt.gz · Last modified: 2023/04/12 20:44 (external edit)