Cheat Sheet Data Wrangling
Cheat Sheet Data Wrangling
database. The JDBC driver must be added the database and exports the Expands the input SQL query to include the join date&time and appends their values in
DB Joiner
configuration window. Grouping of multiple specified by the input connection port.
in the KNIME Preferences and then results into a KNIME data table. of two tables. It has a similar configuration new columns.
conditions with an AND or OR conjunctions If the database table does not exist it
selected in the node configuration window. window as the joiner node. No SQL coding
is also supported. No SQL coding required. will be created.
H2 Connector required. There are more DB nodes, all expanding DB Connection
DB Table Selector Creates a SQL query to access the database the input SQL query with additional SQL DB Query
Connects to an H2 database. Similar
CLEAN DATA
Table Writer
table selected in the configuration window. The Modifies the input SQL query using Writes the resulting rows from the
dedicated connector nodes connect to other instructions. Besides the SQL Query node, no DB custom SQL. The input SQL query is
table can be selected either via browsing the nodes require SQL coding. input SQL query into a new table inside
databases, such as MySQL or PostgreSQL. represented by the place holder
database metadata or via a custom SQL query. the database. Missing Value
#table#. Defines and applies a strategy to
replace missing values in the input
table - either globally on all columns, or
individually for each single column.
RESHAPE AND AGGREGATE DATA DYNAMIC PORT Duplicate
Row Filter Detects duplicate rows and applies the
Concatenate
selected operation, e.g. removes
Dynamic ports: Additional input ports can be duplicate rows. Duplicates are rows
Groups the rows of a table by the added by clicking the three dots in the bottom left that have the same value in all selected
unique values in selected Combine Filter Aggregate Write corner of a node. columns.
GroupBy
columns and calculates
Numeric Outliers
aggregation and statistical Detects and treats numerical outliers for
measures for the defined groups. each of the selected columns
Despite its simple name, it offers individually using the interquartile range
powerful functionality and has
many unsuspected usages.
FORMAT EXCEL SHEETS (IQR).
newly created columns are set to Unpivoting Stacks the cells of the selected value
Adds formatting instructions to merge all cells with a
KNIME Hub: Browse and share workflows,
Math Formula String Manipulation
0 if the value is not present, columns into one column. The cells Implements a number of math operations Performs operations on string values in nodes, and components. Add ratings, or
specified tag in the XLS control table at the top input port.
otherwise 1. This type of encoding of the selected retained columns are across multiple input columns. The math columns, such as combining two or more comments to other workflows at
S
is called one-hot vector. appended to the corresponding operations can be applied to multiple columns strings together, extracting one or more hub.knime.com
Transpose output rows. with the Math Formula (Multi Column) node. substrings, trimming blank spaces, and so on. XLS Conditional
Formatter
Converts the rows to columns and Sorter
Rule Engine String Replacer Adds formatting instructions to color cell backgrounds KNIME Forum: Join our global community &
the columns to rows. Sorts the table in ascending or according to their numeric value for all cells specified by a
Applies a set of rules to each row of the engage in conversations at
descending order based on the Replaces values in a selected string column if tag in the XLS control table at the top
input table. Rules are applied from top to forum.knime.com
values of one or more columns. they match a defined pattern.
bottom. The first rule that matches is used.
XLS Formatter
Performs several transformations (apply) KNIME Server: For team-based collaboration,
Table Manipulator Value Counter
at once, such as renaming, filtering, Counter Generation Column Expressions
Combines the functionality of the Math automation, management, & deployment
Counts the number of occurrences of
re-ordering and type changing, on Creates a new column with a counter. The Formula, Rule Engine, and String Applies all formatting instructions to an existing Excel sheet. check out KNIME Server at
all values in a selected column from
the input columns. By adding start value and step size are defined in the Manipulation nodes. More than one
dynamic ports it can replace a
the input table. www.knime.com/knime-server
configuration window. expression can be defined to modify or add
concatenate node. multiple columns at the same time.