63 orders of magnitude
  • Home
  • Github
  • Bookmarks
Tagged

aggregate

A collection of 2 posts

Finding latest non-null values in columns
Spark

Finding latest non-null values in columns

Imagine we have a table with a sort of primary key where information is added or updated partially: not all the columns for a key are updated each time, but we now want to have a consolidated view of the information, with just one value of the key containing the

  • Marti Sec
Marti Sec Mar 17, 2019 • 3 min read
Pivoting a table with Spark
Spark

Pivoting a table with Spark

A pivot table is an old friend of Business Intelligence: it allows to summarize the data of another table, usually by performing aggregations (sum, mean, max, ...) on aggregated data. They are found in multiple reporting tools (i.e. Qlik, Tableau and Excel) and analytical RDBMS (i.e. Oracle) also implement

  • Marti Sec
Marti Sec Oct 15, 2017 • 3 min read
63 orders of magnitude © 2021
  • Data & privacy
  • About Me
Powered by Ghost