Performance Profiling

From Bohemia Interactive Community
Revision as of 16:34, 19 November 2024 by .kju (talk | contribs) (changed build provider)
Jump to navigation Jump to search

Versions with performance profiling

The Profiling build is provided by Dedmen in tandem with Arma 3 Performance build. It can be found in the BI Forums as well as in the #perf_prof_branch channel in the official Arma 3 Discord server.

  • Through the nature of a Profiling build they are not as performant as the normal release builds!
  • This work also with Arma 3: Diagnostics Exe.


diag_captureFrame sample output
diag_captureFrame sample output
diag_captureFrame sample output


FPS basics

The duration the engine needs to compute all calculations in one cycle is called a frame. The frame-rate, or frames-per-second (fps), in return states how many cycles the engine could compute per second.

The shorter a cycle/frame, the higher the fps.
Note: One second consists of one thousand milliseconds. 1 s = 1000 ms
FPS to frame duration table
FPS seconds milliseconds
100 0.010 10
60 0.017 17
50 0.020 20
40 0.025 25
30 0.033 33
20 0.050 50
10 0.100 100
5 0.200 200
4 0.250 250
3 0.333 333
2 0.500 500
1 1.0 1000


Duration parameter

  • The duration parameter is a time measured in seconds. 0.3 (seconds) is the same as 300 ms / milliseconds.
  • It is used to determine what duration of a frame you consider abnormal or that you are interested in for other reasons.
  • The first time such frame happens, the frame information will be captured - or put differently: when a situation occurs with a frame lasting longer than said duration.
  • 0.3 (<3 fps for the whole frame) is definitely something you should not see in a normal game.
  • If you do not capture any frames with 0.3, try lowering it to 0.2 (200 ms / <5 fps) or 0.1 (100 ms / <10 fps).
  • If it triggers "too early" before the main slowdown happens that you want to analyze, increase it to a higher value, e.g. 1.0 for long freeze (<1 fps).


Scope parameter

  • total is name for the scope of the duration for the whole frame (sLoop on a dedicated server).
  • With the total scope you capture more or less your frames per second (fps) - if your current frame-rate is essentially stable. In other words if all frames for that second have roughly similar duration.
  • In some situations you may encounter single/few frame lasting mini-freezes - aka one/few frames take rather long to complete (ie 100 ms), yet the rest remains in your normal fps range (20 ms for 50 fps for example).
  • Whereas a longer freeze means usually a frame last very long, like a second or longer. Thus the game stalls altogether for the given timeframe.
  • The scope parameter now allows you to capture subparts of a frame, of different parts of the engine computation - in general terms like networking, world and unit simulation, AI behavior, sound, rendering and visibility checks.
  • The total scope is also the mainframe - all things the engine has to complete in a cycle. In addition there are other threads, where the engine moves other tasks to be computed alongside. Essentially this is what multi-threading is about, allowing the CPU with multiple cores to compute more things at the same time, and thus to reduce the duration of the main frame.
  • With the scope parameter you can said subparts from the mainframe, yet also from the so called "worker" threads.
  • The names for scopes/subscopes you can get from the capture frame UI or in the frame capture logging files (.log/.trace).


How to capture

Client

  1. Run a mission
  2. Execute a scripted command diag_captureSlowFrame ["total", 0.3];// [scope, duration] using any means (Debug Console, mission radio trigger...)
  3. Once a slow frame is detected, a window will open
  4. In the window you will be able to browse a lot of performance-related data, which can be interesting
  5. To export the gathered information for sharing with others:
    1. Select Main Thread (if not selected yet)
    2. Press the Copy button
    3. Open an external text editor
    4. Paste the text into a new file
    5. Save the file


Server

How to view a capture

Ingame

Externally

Scripting Commands


diag_captureFrame sample output with custom subtree


Creating Your Own Subtree

When Profiling Per-Frame Eventhandlers (PFH), diag_captureFrame only shows one blob called siFEH that contains all PFH's so you can't see what part of that is caused by your PFH.
You can create your own subtree inside siFEH by wrapping your function call inside a isNil CODE statement like this:
Turn your old call which may look like this:

addMissionEventHandler ["EachFrame", { call myPFHFunction }];

Into something like this:

addMissionEventHandler ["EachFrame", { isNil { call myPFHFunction } // isNil creates the subtree }];

Now when you run diag_captureFrame inside of siPFH you will have a subtree called gsEva and behind that you can see the first line of code inside the isNil statement.
It will only show a part of the first line of that code so you should put something descriptive into the isNil statement.
You can use the same to create a subtree for any function you like. This will also work inside Scheduled (spawned) scripts.
But using this method to "subtree" a function with return values requires a little bit of trickery to get the return value out.


See Also