Update time management of python test generation #1893

tamarinvs19 · 2023-03-03T20:43:37Z

Description

New test generation strategy is:

If function is fully annotated and type inference algorithm cannot infer type we use all time from UI-settings to fuzzing and test generation;
If type inference and infer type we try to fuzz tests for one annotation with the following restrictions:
- max 150 successfully executable tests (with or without exeception)
- max 10 invalid tests (when we generate incorrect arguments and have TypeError or AttributeError; or we cannot generate any valid representative of the inferred type)
- and 5 additional tests (as a precaution)
- until time limit

In future we can modify strategies and add it to UI. For example, wait max coverage.

Fixes #1874
Fixes #1831

How to test

Manual tests

Fully annotated function

For example,

def div(a: int, b: int) :
    return a / b

Expected: two tests (valid and with ZeroDivisionError), test generation process lasted all time from UI-settings

Or,

def str_test(x: str):
    if x[0] == 'b':
        if x[1] == 'a':
            if x[2] == 'd':
                return 'Very bad!'

Expected: all time generation, one test with x = 'ba' or longer (with timeout 60s) and all covered branches with timeout 180s

Without or partially annotated function

For example,

def str_test(x):
    if x[0] == 'b':
        if x[1] == 'a':
            if x[2] == 'd':
                return 'Very bad!'

Expected: all time generation, one test with x = 'b', no one with 'ba' and longer (regardless of timeout)

Self-check list

Check off the item if the statement is true. Hint: [x] is a marked item.

Please do not delete the list or its items.

I've set the proper labels for my PR (at least, for category and component).
PR title and description are clear and intelligible.
I've added enough comments to my code, particularly in hard-to-understand areas.
The functionality I've repaired, changed or added is covered with automated tests.
Manual tests have been provided optionally.
The documentation for the functionality I've been working on is up-to-date.

tochilinak · 2023-03-06T11:52:08Z

utbot-python/src/main/kotlin/org/utbot/python/PythonTestCaseGenerator.kt

                return@breaking
            }

-            algo.run(hintCollector.result, isCancelled, annotationHandler)
+            algo.run(hintCollector.result, typeInferenceCancellation, annotationHandler)

            val existsAnnotation = method.definition.type
            if (existsAnnotation.arguments.all { it.pythonTypeName() != "typing.Any" }) {


Seems like this is wrong. Example:

def f(x: list): pass

In this case we need type inference for intermal type.

We can get information about whether function signature has Any's inside type inference algorithm. In that case initial state has no Any nodes, and the main loop makes only one iteration (initial state -> no new states -> finish).

tochilinak

Problem with setting limitManager.mode = TimeoutMode.

tamarinvs19 · 2023-03-06T12:46:38Z

Problem with setting limitManager.mode = TimeoutMode.

Fixed

tochilinak · 2023-03-06T13:03:18Z

utbot-python/src/main/kotlin/org/utbot/python/PythonTestCaseGenerator.kt

-            if (existsAnnotation.arguments.all { it.pythonTypeName() != "typing.Any" }) {
+            if (iterationNumber == 1) {
+                limitManager.mode = TimeoutMode
+                val existsAnnotation = method.definition.type
                annotationHandler(existsAnnotation)


Initially this code was needed to run fuzzing with initial types (even when those are not full). Will we run fuzzing with list[Any] for the following function?

def f(x: list): ...

Maybe. But only if all arguments have not-typing.Any annotations. And if all annotations are full we run fuzzing twice

Are you sure? Type inference does not run annotationHandler on initial signature.

Now I run fuzzing with initial annotations only after type inference) But I can change program and run fuzzing twice with initial annotations, but I think it isn't a good solution

I added handling of an initial annotation after first expansion in BaselineAlgorithm.

if annotation is full we cannot expand it and stop type inference and then start fuzzing;

if annotation is not full we try to expand and first of all try to fuzz initial annotation (for example, if it is list[typing.Any] we can generate empty list), after that we continue type inference

utbot-python/src/main/kotlin/org/utbot/python/PythonEngine.kt

Markoutte · 2023-03-07T05:22:42Z

utbot-python/src/main/kotlin/org/utbot/python/PythonEngine.kt

@@ -195,7 +197,6 @@ class PythonEngine(

                        is PythonEvaluationSuccess -> {
                            val coveredInstructions = evaluationResult.coverage.coveredInstructions
-                            coveredInstructions.forEach { coveredLines.add(it.lineNumber) }

                            val summary = arguments
                                .zip(methodUnderTest.arguments)


On line 215 (sorry, that I comment another line, because I was not able to find how to comment 215) there's no check that:

is ValidExecution -> { val trieNode: Trie.Node<Instruction> = description.tracer.add(coveredInstructions)

Returns the node the was already found. You can check trieNode.count > 1 to ignore duplicates and minimise the count of tests for user. At the moment similar tests are generated with different arguments, but same trace.

Refactor time management

b002d99

tamarinvs19 added ctg-refactoring Issue related to refactoring process comp-infrastructure Infrastructure issues comp-fuzzing Issue is related to the fuzzing lang-python Issue is related to python support labels Mar 3, 2023

tamarinvs19 requested review from denis-fokin, tyuldashev and tochilinak March 3, 2023 20:43

tamarinvs19 self-assigned this Mar 3, 2023

tochilinak reviewed Mar 6, 2023

View reviewed changes

tochilinak requested changes Mar 6, 2023

View reviewed changes

tamarinvs19 added 2 commits March 6, 2023 15:45

Add iteration counter to baseline algorithm

dd863d1

Merge branch 'main' into tamarinvs19/timeout-generation

f56ffaf

tochilinak reviewed Mar 6, 2023

View reviewed changes

Add initial annotation handling

b0cbb92

tochilinak approved these changes Mar 6, 2023

View reviewed changes

Markoutte requested review from Markoutte and removed request for denis-fokin March 7, 2023 05:17

Markoutte requested changes Mar 7, 2023

View reviewed changes

Update fuzzing process: add duplicate ignoring, fix feedback returning

310c71b

Markoutte approved these changes Mar 9, 2023

View reviewed changes

tamarinvs19 merged commit 9c898bb into main Mar 9, 2023

tamarinvs19 deleted the tamarinvs19/timeout-generation branch March 9, 2023 06:59

tamarinvs19 mentioned this pull request Mar 9, 2023

Fix python fuzzing strategy #1914

Merged

6 tasks

alisevych added this to the 2023.03 Release milestone Mar 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update time management of python test generation #1893

Update time management of python test generation #1893

Uh oh!

tamarinvs19 commented Mar 3, 2023 •

edited

Loading

Uh oh!

tochilinak Mar 6, 2023

Uh oh!

tochilinak left a comment

Uh oh!

tamarinvs19 commented Mar 6, 2023

Uh oh!

tochilinak Mar 6, 2023

Uh oh!

tamarinvs19 Mar 6, 2023

Uh oh!

tochilinak Mar 6, 2023

Uh oh!

tamarinvs19 Mar 6, 2023

Uh oh!

tamarinvs19 Mar 6, 2023

Uh oh!

Uh oh!

Markoutte Mar 7, 2023

Uh oh!

tamarinvs19 Mar 8, 2023

Uh oh!

Uh oh!

Update time management of python test generation #1893

Update time management of python test generation #1893

Uh oh!

Conversation

tamarinvs19 commented Mar 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How to test

Manual tests

Fully annotated function

Without or partially annotated function

Self-check list

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tochilinak left a comment

Choose a reason for hiding this comment

Uh oh!

tamarinvs19 commented Mar 6, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tamarinvs19 commented Mar 3, 2023 •

edited

Loading