Data Normalization

Learned from SE464.

Data Normalization

A normalized relational database has no duplication of data.

  • References (foreign keys) point to shared data
  • In effect, many users are related to each other by all being linked to that industry
  • To optimally partition the rows into shards, we could solve a balanced graph partitioning problem.

SQL uses normalized data. NoSQL uses denormalized data.

#serendipity I try to make my obsidian normalized, but sometimes, it’s just easier to have it denormalized. Places like CS247 or SE464 where I just have all the notes in a centralized location, but also make separate pages for later.