Once you have a data set, you can find a suitable model or function using regression. The range between the lowest and highest observed -value is called the validity range of the model. Within this area, as long as you have chosen a suitable model or function, you can be confident that your results correspond relatively well with reality. But if you look at values outside this range, the results might deviate a lot from reality.
You can use the model you have chosen to calculate the function value (-value) of points other than the specific data points you were given.
Theory
When you find the function value of a point that is within the validity range of your model, you perform an interpolation. Selecting points within the domain of the model will give you the most reliable results.
Theory
When you find the function value of a point that is outside the validity range, you’re performing an extrapolation. This result can deviate significantly from the real values.
Example 1
Jay-Z weighed when he was born. The table below shows his weight, kg, days after his birth.
Age ( days) | Weight ( kg) |
0 | |
7 | |
20 | |
30 | |
57 | |
Use regression to find a linear model for Jay Z’s weight from the data in the table.
From the table you can see that the validity range of the model is determined by the values in the interval . You use digital tools to find the linear model for Jay Z’s weight days after his birth. The model is the function
Based on the model you found, how much did Jay Z weigh after 14 days?
You insert for in the function and find Jay Z’s weight after days:
How much did Jay Z weigh after one year (365 days), based on the model you found?
You insert for and calculate Jay Z’s weight after days:
Because a one-year-old normally weighs between kg and kg, Jay Z was either extremely overweight, or your result is so different due to the value you used is outside the validity range of your data set.
It is more likely that the model’s accuracy is poor outside the validity range, rather than that Jay Z was extremely overweight.
Because you selected a value outside the range (), you performed an extrapolation.