Even if you can use the entire initial dataset to fit a single model, it turns
out that ensemble methods, where you fit multiple smaller models by using
subsets of data, generally outperform single models. Indeed, fitting a single
model with 100M data points can perform worse than fitting just a few
models with 10M data points each (so smaller total data outperforms larger
total data).