Learning with not Enough Data Part 3: Data Generation

Lilian Weng·Lilian Weng·AI·April 15, 2022

Here comes the Part 3 on learning with not enough data (Previous: Part 1 and Part 2). Let’s consider two approaches for generating synthetic data for training. Augmented data. Given a set of existing training samples, we can apply a variety of augmentation, distortion and transformation to derive new data points without losing the key attributes. We have covered a bunch of augmentation methods on text and images in a previous post on contrastive learning. For the sake of post completeness, I dup...

Read full article →

Learning with not Enough Data Part 3: Data Generation

Related Articles