Abstract
Testing probabilistic programs is non-trivial due to their stochastic nature. Given an input, the program may produce different outcomes depending on the underlying stochastic choices in the program. This means testing the expected outcomes of probabilistic programs requires repeated test executions unlike deterministic programs where a single execution may suffice for each test input. This raises the following question: how many times should we run a probabilistic program to effectively test it? This work proposes a novel black-box unit testing method, ProbTest, for testing the outcomes of probabilistic programs. Our method is founded on the theory surrounding a well-known combinatorial problem, the coupon collector’s problem. Using this method, developers can write unit tests as usual without extra effort while the number of required test executions is determined automatically with statistical guarantees for the results. We implement ProbTest as a plug-in for PyTest, a well-known unit testing tool for python programs. Using this plug-in, developers can write unit tests similar to any other Python program and the necessary test executions are handled automatically. We evaluate the method on case studies from the Gymnasium reinforcement learning library and a randomized data structure.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of 23rd International Conference on Software Engineering and Formal Methods (SEFM 2025) |
| Number of pages | 19 |
| Volume | 16192 |
| Publisher | Springer Nature Switzerland |
| Publication date | 12 Nov 2025 |
| Pages | 91-109 |
| ISBN (Print) | 978-3-032-10443-4 |
| ISBN (Electronic) | 978-3-032-10444-1 |
| DOIs | |
| Publication status | Published - 12 Nov 2025 |
| Event | Software Engineering and Formal Methods - Spain, Toledo, Spain Duration: 12 Nov 2025 → 14 Nov 2025 Conference number: 23 https://sefm-conference.github.io/2025/ |
Conference
| Conference | Software Engineering and Formal Methods |
|---|---|
| Number | 23 |
| Location | Spain |
| Country/Territory | Spain |
| City | Toledo |
| Period | 12/11/2025 → 14/11/2025 |
| Internet address |
| Series | LNCS |
|---|---|
| Volume | 16192 |
Keywords
- Probabilistic programming
- Black-box unit testing
- Coupon collector problem
- Statistical guarantees
- PyTest plugin