-
- Downloads
Reduce memory cost of conjugate gradient descent
Before I had five vectors, now just four. The realization is that the last gradient and the temporary new point (used for the line search) are never needed at the same time, so they can share memory.
Loading
Please register or sign in to comment