time-to-botec/C-optimized/README.md

33 lines
1.3 KiB
Markdown
Raw Normal View History

2023-05-29 21:17:07 +00:00
# C-Optimized
2023-05-29 22:48:25 +00:00
An optimized version of the original C implementation.
The main changes are:
- an optimization of the mixture function (it passes the functions instead of the whole arrays, reducing in great measure the memory usage and the computation time) and
- the implementation of multi-threading with OpenMP.
2023-05-29 21:17:07 +00:00
2023-05-29 23:40:03 +00:00
## Performance
2023-05-29 21:17:07 +00:00
The mean time of execution is 6 ms. With the following distribution:
2023-05-29 22:48:25 +00:00
![Time histogram](https://i.imgur.com/6iT2PkF.png)
2023-05-29 21:17:07 +00:00
2023-05-29 21:30:07 +00:00
The hardware used has been an AMD 5800x3D and 16GB of DDR4-3200 MHz.
2023-05-29 22:48:25 +00:00
Also, the time data has been collected by executing the interior of the main() function 1000 times in a for loop, not executing the program itself 1000 times.
2023-05-29 23:40:03 +00:00
## Multithreading
Take into account that the multi-threading introduces a bit of dispersion in the execution time due to the creation and destruction of threads.
In Nuño's machine, multithreading actually introduces a noticeable slowdown factor.
2023-05-30 22:44:29 +00:00
## To do
2023-06-02 19:56:50 +00:00
- [ ] Use proper profiling tool to capture timing with 1M samples.
2023-05-30 22:44:29 +00:00
- [ ] Update above with correct timing
- [ ] Add Windows/Powershell time-measuring commands
- [ ] Add CUDA?
- [ ] See if program can be reworded so as to use multithreading effectively, e.g., so that you see speed gains proportional to the number of threads used