SUSE Hack Week: Implement text based OCR in openQA

Project Description

Currently openQA requires a reference image to be stored to do OCR based comparisons. It is not possible to pass a character string to openQA which should be compared to the text in the screenshot. This project is about allowing to just store character strings in the corresponding JSON file of the needle and to get rid of any reference images in case of OCR needles.

Status

Research about possible tools was done. The result was that the current implementation based on Tesseract appears to be too inaccurate on short character strings. The program GOCR seems to do more classical recognition by shape which seems to work reasonably accurate on well shaped characters. The accuracy of the matched strings could be calculated using the library perl-Text-Levenshtein.

Goal for this Hackweek

Create draft implementation of OCR in os-autoinst.
Optional: Create easy handling of text based OCR needles in openQA web frontend (e.g. providing live preview of recognized text)

Resources

This project is tracked here: https://progress.opensuse.org/issues/121354
openQA frontend repo: https://github.com/os-autoinst/openQA
openQA backend repo: https://github.com/os-autoinst/os-autoinst
GOCR: https://wasd.urz.uni-magdeburg.de/jschulen/ocr/
Perl-Text-Levenshtein: https://github.com/neilb/Text-Levenshtein

Join this project Leave this project

Looking for hackers with the skills:

openqa mojolicious perl ocr os-autoinst

This project is part of:

Hack Week 22

Activity

almost 3 years ago: okurz liked this project.

almost 3 years ago: jzerebecki liked this project.

almost 3 years ago: pdostal liked this project.

about 3 years ago: mkoutny liked this project.

about 3 years ago: ggardet_arm left this project.

about 3 years ago: ggardet_arm joined this project.

about 3 years ago: ggardet_arm liked this project.

about 3 years ago: robert.richardson liked this project.

about 3 years ago: dancermak liked this project.

about 3 years ago: ybonatakis liked this project.

about 3 years ago: clanig started this project.

about 3 years ago: clanig added keyword "openqa" to this project.

about 3 years ago: clanig added keyword "mojolicious" to this project.

about 3 years ago: clanig added keyword "perl" to this project.

about 3 years ago: clanig added keyword "ocr" to this project.

about 3 years ago: clanig added keyword "os-autoinst" to this project.

about 3 years ago: clanig originated this project.

Comments

almost 3 years ago by okurz | Reply

There is very basic support for OCR in os-autoinst with https://github.com/os-autoinst/os-autoinst/blob/master/ocr.pm which might give you some good ideas and a starting base. https://github.com/os-autoinst/os-autoinst/blob/master/t/02-test_ocr.t shows its usage

almost 3 years ago by clanig | Reply

Created draft PR: https://github.com/os-autoinst/os-autoinst/pull/2276

Similar Projects

openqa

openQA tests needles elaboration using AI image recognition by mdati

Description

In the openQA test framework, to identify the status of a target SUT image, a screenshots of GUI or CLI-terminal images, the needles framework scans the many pictures in its repository, having associated a given set of tags (strings), selecting specific smaller parts of each available image. For the needles management actually we need to keep stored many screenshots, variants of GUI and CLI-terminal images, eachone accompanied by a dedicated set of data references (json).

A smarter framework, using image recognition based on AI or other image elaborations tools, nowadays widely available, could improve the matching process and hopefully reduce time and errors, during the images verification and detection process.

Goals

Main scope of this idea is to match a "graphical" image of the console or GUI status of a running openQA test, an image of a shell console or application-GUI screenshot, using less time and resources and with less errors in data preparation and use, than the actual openQA needles framework; that is:

having a given SUT (system under test) GUI or CLI-terminal screenshot, with a local distribution of pixels or text commands related to a running test status,
we want to identify a desired target, e.g. a screen image status or data/commands context,
- based on AI/ML-pretrained archives containing object or other proper elaboration tools,
- possibly able to identify also object not present in the archive, i.e. by means of AI/ML mechanisms.
the matching result should be then adapted to continue working in the openQA test, likewise and in place of the same result that would have been produced by the original openQA needles framework.
We expect an improvement of the matching-time(less time), reliability of the expected result(less error) and simplification of archive maintenance in adding/removing objects(smaller DB and less actions).

Hackweek POC:

Main steps

Phase 1 - Plan
- study the available tools
- prepare a plan for the process to build
Phase 2 - Implement
- write and build a draft application
Phase 3 - Data
- prepare the data archive from a subset of needles
- initialize/pre-train the base archive
- select a screenshot from the subset, removing/changing some part
Phase 4 - Test
- run the POC application
- expect the image type is identified in a good %.

Resources

First step of this project is quite identification of useful resources for the scope; some possibilities are:

SUSE AI and other ML tools (i.e. Tensorflow)
Tools able to manage images
RPA test tools (like i.e. Robot framework)
other.

Project references

Repository: openqa-needles-AI-driven

openQA log viewer by mpagot

Description

*** Warning: Are You at Risk for VOMIT? ***

Do you find yourself staring at a screen, your eyes glossing over as thousands of lines of text scroll by? Do you feel a wave of text-based nausea when someone asks you to "just check the logs"?

You may be suffering from VOMIT (Verbose Output Mental Irritation Toxicity).

This dangerous, work-induced ailment is triggered by exposure to an overwhelming quantity of log data, especially from parallel systems. The human brain, not designed to mentally process 12 simultaneous autoinst-log.txt files, enters a state of toxic shock. It rejects the "Verbose Output," making it impossible to find the one critical error line buried in a 50,000-line sea of "INFO: doing a thing."

Before you're forced to rm -rf /var/log in a fit of desperation, we present the digital antacid.

No panic: we have The openQA Log Visualizer

This is the UI antidote for handling toxic log environments. It bravely dives into the chaotic, multi-machine mess of your openQA test runs, finds all the related, verbose logs, and force-feeds them into a parser.

image

Goals

Work on the existing POC openqa-log-visualizer about few specific tasks:

add support for more type of logs
extend the configuration file syntax beyond the actual one
work on log parsing performance

Find some beta-tester and collect feedback and ideas about features

If time allow for it evaluate other UI frameworks and solutions (something more simple to distribute and run, maybe more low level to gain in performance).

Resources

openqa-log-visualizer

MCP Perl SDK by kraih

Description

We've been using the MCP Perl SDK to connect openQA with AI. And while the basics are working pretty well, the SDK is not fully spec compliant yet. So let's change that!

Goals

Support for Resources
All response types (Audio, Resource Links, Embedded Resources...)
Tool/Prompt/Resource update notifications
Dynamic Tool/Prompt/Resource lists
New authentication mechanisms

Resources

Bring up Agama based tests for openSUSE Tumbleweed by szarate

Description

Agama has been around for some time already, and we have some tests for it on Tumbleweed however they are only on the development job group and are too few to be helpful in assessing the quality of a build

This project aims at enabling and creating new testsuites for the agama flavor, using the already existsing DVD and NET flavors as starting points

Goals

Introduce tests based on the Agama flavor in the main Tumbleweed job group
Create Tumbleweed yaml schedules for agama installer and its own jsonette profile (The one being used now are reused from leap)
Fan out tests that have long runtimes (i.e tackle this ticket)
Reduce redundancy in tests

Resources

Tumbleweed development job group:
Tumbleweed main job group in git
osado test repository:

perl

Create a page with all devel:languages:perl packages and their versions by tinita

Description

Perl projects now live in git: https://src.opensuse.org/perl

It would be useful to have an easy way to check which version of which perl module is in devel:languages:perl. Also we have meta overrides and patches for various modules, and it would be good to have them at a central place, so it is easier to lookup, and we can share with other vendors.

I did some initial data dump here a while ago: https://github.com/perlpunk/cpan-meta

But I never had the time to automate this.

I can also use the data to check if there are necessary updates (currently it uses data from download.opensuse.org, so there is some delay and it depends on building).

Goals

Have a script that updates a central repository (e.g. https://src.opensuse.org/perl/_metadata) with metadata by looking at https://src.opensuse.org/perl/_ObsPrj (check if there are any changes from the last run)
Create a HTML page with the list of packages (use Javascript and some table library to make it easily searchable)

Resources

Results

Day 1

First part of the code which retrieves data from https://src.opensuse.org/perl/_ObsPrj with submodules and creates a YAML and a JSON file.
Repo: https://github.com/perlpunk/opensuse-perl-meta
Also a first version of the HTML is live: https://perlpunk.github.io/opensuse-perl-meta/

Day 2

HTML Page has now links to src.opensuse.org and the date of the last update, plus a short info at the top
Code is now 100% covered by tests: https://app.codecov.io/gh/perlpunk/opensuse-perl-meta
I used the modern perl class feature, which makes perl classes even nicer and shorter. See example
Tests
- I tried out the mocking feature of the modern Test2::V0 library which provides call tracking. See example
- I tried out comparing data structures with the new Test2::V0 library. It let's you compare parts of the structure with the like function, which only compares the date that is mentioned in the expected data. example

Day 3

Added various things to the table
- Dependencies column
- Show popup with info for cpanspec, patches and dependencies
- Added last date / commit to the data export.

Plan: With the added date / commit we can now daily check _ObsPrj for changes and only fetch the data for changed packages.

Day 4

MCP Perl SDK by kraih

Description

We've been using the MCP Perl SDK to connect openQA with AI. And while the basics are working pretty well, the SDK is not fully spec compliant yet. So let's change that!

Goals

Support for Resources
All response types (Audio, Resource Links, Embedded Resources...)
Tool/Prompt/Resource update notifications
Dynamic Tool/Prompt/Resource lists
New authentication mechanisms

Project Description

Status

Goal for this Hackweek

Resources

Looking for hackers with the skills:

This project is part of:

Activity

Comments

almost 3 years ago by okurz | Reply

almost 3 years ago by clanig | Reply

Similar Projects

openqa

openQA tests needles elaboration using AI image recognition by mdati

Description

Goals

Hackweek POC:

Resources

Project references

openQA log viewer by mpagot

Description

Goals

Resources

MCP Perl SDK by kraih

Description

Goals

Resources

Bring up Agama based tests for openSUSE Tumbleweed by szarate

Description

Goals

Resources

perl

Create a page with all devel:languages:perl packages and their versions by tinita

Description

Goals

Resources

Results

Day 1

Day 2

Day 3

Day 4

MCP Perl SDK by kraih

Description

Goals

Resources