What is inverse regression?
Inverse regression (also called reciprocal regression) fits a set of paired observations to the model \(y = A + \frac{B}{x}\). This curve is the natural choice when a quantity falls off in proportion to \(\frac{1}{x}\) — for example, when the response is high for small \(x\) and tends toward a constant baseline \(A\) as \(x\) grows large. Because the model is linear in the transformed predictor \(u = \frac{1}{x}\), it can be solved exactly with ordinary least squares. This is a universal mathematics and statistics tool: the formulas are the same everywhere.
How to use it
Enter your data with one point per line in the form x, y (comma or space separated). You need at least two points, and every \(x\) must be non-zero because the model uses \(\frac{1}{x}\). Pick the number of significant digits for the displayed output (this affects formatting only, not the internal computation). The calculator returns the intercept \(A\), the numerator coefficient \(B\), the correlation coefficient \(r\) between \(\frac{1}{x}\) and \(y\), the fully substituted equation, and a sampled fitted curve you can plot against your scatter points.
The formula explained
For each point compute \(u_i = \frac{1}{x_i}\), then run a simple linear regression of \(y\) on \(u\). Using the means \(\bar{u}\) and \(\bar{y}\), form $$S_{uu} = \sum u^2 - n\cdot\bar{u}^2,$$ $$S_{uy} = \sum (u\cdot y) - n\cdot\bar{u}\cdot\bar{y},$$ and $$S_{yy} = \sum y^2 - n\cdot\bar{y}^2.$$ Then the slope is \(B = \frac{S_{uy}}{S_{uu}}\), the intercept is \(A = \bar{y} - B\cdot\bar{u}\), and the correlation is $$r = \frac{S_{uy}}{\sqrt{S_{uu}} \cdot \sqrt{S_{yy}}}.$$
Worked example
For \(x = [1, 2, 3, 4, 5]\) and \(y = [6, 3, 2, 1.5, 1.2]\): \(\bar{u} = 0.456667\), \(\bar{y} = 2.74\), \(S_{uu} = 0.420889\), \(S_{uy} = 2.525333\), \(S_{yy} = 15.152\). So $$B = \frac{2.525333}{0.420889} = 6.000,$$ $$A = 2.74 - 6\cdot 0.456667 = 0.000,$$ and $$r = \frac{2.525333}{0.648759 \cdot 3.892557} = 1.000.$$ The fit is exactly \(y = \frac{6}{x}\), a perfect inverse relationship.
FAQ
Why must \(x\) be non-zero? The model uses \(\frac{1}{x}\), which is undefined at \(x = 0\), so any such row is skipped and reported.
What does \(r\) mean here? It measures how well \(\frac{1}{x}\) predicts \(y\) linearly: \(|r|\) above 0.7 is strong, 0.4–0.7 moderate, 0.2–0.4 weak, below 0.2 no correlation.
When does the fit fail? If all \(x\) values are equal, every \(\frac{1}{x}\) is identical, \(S_{uu} = 0\), and the slope cannot be determined.