Profiling Routines
The following table shows profiling routines provided by NLCPy.
Profiling with FTRACE
Enables profiling with an ftrace region during 'with' statement. |
|
Begins an ftrace region. |
|
Ends an ftrace region. |
FTRACE is a performance analysis tool, which can obtain performance information such as the exclusive time and vectorization aspect on the VE. Please note that VE offloading overhead is included in performance information. For details of FTRACE, see PROGINF/FTRACE User’s Guide.
Note
VE functions in NLCPy are multithreaded. However, the ftrace.out includes performance information of only a master thread. To obtain performance information of all threads, it is necessary to compile C programs of NLCPy by using ncc with the option ‘-ftrace’. If needed, please download source programs of NLCPy from GitHub.
Start and Stop Profiling
Starts profiling. |
|
Stops profiling. |
Get Result
Gets dict of NLCPy run stats. |
|
Prints NLCPy run stats. |