https://medium.com/asap-report/how-to-record-data-for-reinforcement-learning-agent-from-any-game-on-linux-5660733a5ee8

 
 
 
Home
 
Notifications
 
Lists
 
Stories
---------------------------------------------------------------------
 
Write
 
Acta Schola Automata Polonica
Published in
 

Acta Schola Automata Polonica

 
Filip Nabrdalik
Filip Nabrdalik
Follow

Jan 30, 2020

*
6 min read

How to record data for reinforcement learning agent from any game on
Linux

Written by Filip Nabrdalik, Piotr Smuda and Piotr Tempczyk

[1]
Polish astronomer Jan Heweliusz observing with one of his telescopes
(1647)

Number of games supporting OpenAI Gym API is already big and is still
growing. These games enable you to train reinforcement learning
agents with ease.

But what, if you want to play your favorite game and it hasn't got an
API suitable for RL? If the game has any API, your problem is
partially solved but when the game of your choice is meant to be
played only using keyboard, mouse, pad or racing wheel your problem
is much worse. From this blog post, you are going to find out what
you can do in this situation.

As an example game, we are going to use F1 2017, which you can get
for free from here.

Solution

Recording frames

First of all, we need visual input for our reinforcement learning
algorithm. We can do this using some Linux tools.

We can achieve this using only the command line. To record a window
on the screen we are going to use ffmpeg and built-in Xorg utilities.
To install ffmpeg in Ubuntu, all you need is type in the command line
the following command:

sudo apt install ffpmeg

At first, let's have a look at the example ffmpeg command which
allows us to capture 800x600 video with upper left corner located at
position (50,100) on the screen.

ffmpeg  -y -video_size 800x600 -framerate 30 -f x11grab -i  :0.0+50,100 -c:v libx264 -crf 0 -preset ultrafast f1_recording.mkv

We use the x11grab input video device to get the screen output, then
we process it and save it as MKV file using the x264 codec. We need
to give a proper input to the ffmpeg command, which has to have the
following format

-i :0.0+50,100

which can be translated as:

-i :DISPLAYID.SCREENID+XPOSITION,YPOSITION

Let's see how to get the information about a window. First, let's
input w -hs command and find out the display identifier (FROM
column). You can also find it in the DISPLAY env variable in your
shell.

01:07:38 up 5 days, 58 min,  4 users,  load average: 0.40, 0.37, 0.41
USER     TTY      FROM             LOGIN@   IDLE   JCPU   PCPU WHAT
oli      tty7     :0               Sat00    5days  4:22m  0.94s gnome-session
oli      pts/4    :0               Sat00   47:09m  0.77s  0.77s /bin/bash
oli      pts/6    :0               Wed02    0.00s  0.12s  0.00s w

The second step is to execute xmwininfo -tree -root and find out the
window id. Example output:

Root window id: 0x29a (the root window) (has no name)
  Parent window id: 0x0 (none)
     69 children:
     0x2800001 "/usr/lib/firefox/firefox":  10x10+10+10  +10+10
     0x220003c "F1 ": 190x29+487+105  +487+105
...

Now we can get detailed info about the window which we are interested
in by executing xwininfo -id 0x220005c

  Absolute upper-left X:  0
  Absolute upper-left Y:  0
  Relative upper-left X:  10
  Relative upper-left Y:  38
  Width: 800
  Height: 600
  Depth: 24
  Visual: 0x21
...

We have got all the information we need to record. Let's start ffmpeg
in a separate terminal.

ffmpeg  -y -video_size 800x600 -framerate 30 -f x11grab -i  :0.0+50,100 -c:v libx264 -crf 0 -preset ultrafast f1_recording.mkv

After pressing CTRL+c the recording is stopped and we can play the
file using any video player. We can also use pipe to play the video
in VLC Player or process it with another tool.

ffmpeg  -y -video_size 800x600 -framerate 30 -f x11grab -i 0.0+50,100 -c:v libx264 -vf -crf 0 -preset ultrafast -f mpegts pipe:1 | vlc -

Notice that the file format is different when using a pipe. You can
play with different options available in ffmpeg to achieve different
results. Have a look at ffmpeg documentation to find out many options
which this tool provides. You can also check out our repository to
see bash scripts that we used during the early stages of our project.

Recording with Python script

We have created a simple python script which automates the whole
process. You can find it here. Just change the APP_NAME variable to
record the app you want. It was written using ffmpeg-python library
which nicely wraps around ffmpeg. The library is quite powerful and
you can use it to process your video with numpy or tensorflow.

In Python code, we follow exactly the same steps which were described
in the previous chapter. First, we get the display id with
get_current_display_id(), then we search for app window id that we
need using a part of its name get_window_id(name), the third step is
to get window position and size with get_window_attrs(window_id).
Finally, we can pass this information to the ffmpeg-python library
and record what we need. In the example below, we also do a little
bit of processing before saving it to a file. We crop and re-scale
the input from x11grab, so just remove corresponding lines from
ffmpeg command builder if you don't want to do so. Also, remember to
check out our repository.

screen-recording.py

Getting in-game data and controller's state

F1 2017 game allows to extract a lot of game data, where the most
interesting is car's telemetry, which can be obtained through User
Datagram Protocol (UDP). Moreover, using external software it is
possible to get game controller's state (in our case it was Logitech
G27 wheel), which in combination with telemetry gives lots of useful
information.

To enable telemetry output you need to go to the game options and
navigate to Game Options -> Settings -> UDP Telemetry Settings and
turn on UDP and broadcast options. You also should configure the IP
address, port for the receiving application and send rate per second,
which we set respectively as 127.0.0.1, 20778 and 30. So, from now
the game will be sending in-game data during racing such as
throttle's/steer's state or race position! More about available data
you can find here.

The only thing left is to receive it using Python. First of all, it
is worth pointing out that we use pre-installed socket module for
telemetry data and external pygame library for wheel's data.
Therefore, it is possible that you need to install pygame -- just type
in the command line following command:

pip install pygame

Now, let's look at how all works. Firstly, we need to initialize
socket and getter, which enable us to receive respectively telemetry
and wheel's data (for both, list of data fields' names should be
provided in the correct order!). For setting socket we use the same
IP address and port which are set in game:

SOCKET = socket.socket(socket.AF_INET, socket.SOCK_DGRAM)
SOCKET.bind((UDP_IP, UDP_PORT))

and for the getter it looks like this:

pygame.init()
wheel = pygame.joystick.Joystick(wheel_index)
wheel.init()

where wheel_index is an index of the controller in system (usually it
is 0 if there is only a wheel connected to PC as controller).

In the F1 2017 game, data is broadcasted only when you are in control
of the car (if game is paused or in the menu, there is no available
data). So, we can try to capture data in loop using our socket and
getter. If there is any telemetry data, we intercept it from the
buffer and decode it from bytes to numerical values using the struct
module:

telemetry_data, addr = SOCKET.recvfrom(1289)
value = struct.unpack(value_type, telemetry_data[x : x + nbytes])[0]

where 1289 is number of bytes in packet size, value_type stands for
"f " or "b" respectively for float or byte values, x is position in
the packet and nbytes is number of values bytes. At the same time, we
take wheel state with get() method of WheelGetter class and combining
it with telemetry data we write all data to one row of
pandas.DataFrame. Finally, we can save received data to the new line
in CSV file on disk. To find out more about implementation please
look at our repository.

[1]
game_data.py

Final remarks

This blogpost is one of the results of a bigger project, which goal
was to train an RL agent to play F1 2017. There were many other
people involved in this project: Maciej Sliwowski, Bartosz Topolski,
Tomasz Malisz, Jakub Grudzien, Tomasz Arczewski. During this project,
we also developed a framework for reinforcement learning named
People's Reinforcement Learning (PRL).

[0]

If you enjoyed this post, please hit the clap button below and follow
our publication for more interesting articles about ML & AI.

 

274 

 

274

More from Acta Schola Automata Polonica

Scientific blog about artificial intelligence and machine learning

Read more from Acta Schola Automata Polonica

More from Medium

 
[1]
Browser Automation for Free with Selenium and Openshift Online
 
[1]
Diagram as Code (Service architecture diagram)
 
Programming insight in 2021
 
[1]
Perfecting Process for Presenting PRs
I'm a visual person. When I'm reviewing code, I really appreciate
when pull requests add some context around how a code change affects
the...
 
[0]
A Developer's guide to project management
Dear managers, your subordinates might not be telling you this
 
[0]
Cracking the "Impact Mapping" code for a winning product strategy
 
the right path todespair https://t.co/ZelvdH2h0k https://t.co/
D1KxdBBaeX
 
Blackjack Progress #8 [Penzu]
Mon. 8/17/2020:
Get started
[                    ]
 
Filip Nabrdalik

Filip Nabrdalik

6 Followers

http://opium.sh https://www.linkedin.com/in/filipnabrdalik/

Follow
 

Related

 
[0]
Face Recognition System (Mask Detection, Emotion, Gender and Age
Estimation) with Python
 
[0]
SOCIAL DISTANCING USING YOLOV3 -- OBJECT DETECTION -- WITH SOURCE CODE
 
[1]
Model Building and Model Evaluation
Part 4 of Dog Detection App
 
[0]
Reinforcement Learning with Q-Learning -- Part 1
 

Help

 

Status

 

Writers

 

Blog

 

Careers

 

Privacy

 

Terms

 

About

 

Knowable