精读《设计机器学习系统》-ch03: 数据工程基础

这一章大部分内容和DDIA很相似,作者也说
“If you’re already familiar with data systems, you might want to move directly to Chapter 4 to learn more about how to sample and generate labels to create training data. If you want to learn more about data engineering from a systems perspective, I recommend Martin Kleppmann’s excellent book Designing Data-Intensive Applica‐ tions (O’Reilly, 2017).”
所以先跳过



