Checkout
Personalized AI apps
Build multi-agent systems without code and automate document search, RAG and content generation
Start free trial
Question

Training Data - How do you know you have enough training data?

Answer

Determining the sufficiency of a dataset typically involves the application of the 10 times rule. This principle suggests that the quantity of input data, that is, the number of instances, should be at least tenfold the count of a model's degrees of freedom.