⚡️ Speed up function _gridmake2 by 524%
#1001
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
📄 524% (5.24x) speedup for
_gridmake2incode_to_optimize/discrete_riccati.py⏱️ Runtime :
3.50 milliseconds→561 microseconds(best of82runs)📝 Explanation and details
Optimization Explanation:
The original implementation uses
np.tile,np.repeat, andnp.column_stackwhich create intermediate arrays and involve multiple memory allocations. By using Numba's JIT compilation with nopython mode, we can pre-allocate the output array and fill it directly with efficient loops, eliminating intermediate allocations and leveraging Numba's optimized code generation for significant speedup, especially for large inputs.✅ Correctness verification report:
⚙️ Click to see Existing Unit Tests
test_gridmake2.py::TestGridmake2EdgeCases.test_both_empty_arraystest_gridmake2.py::TestGridmake2EdgeCases.test_empty_arrays_raise_or_return_emptytest_gridmake2.py::TestGridmake2EdgeCases.test_float_dtype_preservedtest_gridmake2.py::TestGridmake2EdgeCases.test_integer_dtype_preservedtest_gridmake2.py::TestGridmake2NotImplemented.test_1d_first_2d_second_raisestest_gridmake2.py::TestGridmake2NotImplemented.test_both_2d_raisestest_gridmake2.py::TestGridmake2With1DArrays.test_basic_two_element_arraystest_gridmake2.py::TestGridmake2With1DArrays.test_different_length_arraystest_gridmake2.py::TestGridmake2With1DArrays.test_float_arraystest_gridmake2.py::TestGridmake2With1DArrays.test_larger_arraystest_gridmake2.py::TestGridmake2With1DArrays.test_negative_valuestest_gridmake2.py::TestGridmake2With1DArrays.test_result_shapetest_gridmake2.py::TestGridmake2With1DArrays.test_single_element_arraystest_gridmake2.py::TestGridmake2With1DArrays.test_single_element_with_multi_elementtest_gridmake2.py::TestGridmake2With2DFirst.test_2d_first_1d_secondtest_gridmake2.py::TestGridmake2With2DFirst.test_2d_multiple_columnstest_gridmake2.py::TestGridmake2With2DFirst.test_2d_single_column🌀 Click to see Generated Regression Tests
To edit these changes
git checkout codeflash/optimize-_gridmake2-mjt4c9bfand push.