About Social Code
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorLucas Fryzek <lucas.fryzek@gmail.com>2023-02-28 14:55:57 -0500
committerLucas Fryzek <lucas.fryzek@gmail.com>2023-02-28 14:55:57 -0500
commitc4a96d638320cf134e0cc384c76b9a7907ab26f3 (patch)
tree0678c9eb04f37498c32205e13e4777f58d626a1e
parent25b08ce8bbc34c82558b86111f07fb88bf3b19dd (diff)
Add freedreno post
-rw-r--r--html/assets/freedreno/3d-mark.pngbin0 -> 3170073 bytes
-rw-r--r--html/assets/freedreno/glinfo_freedreno.pngbin0 -> 119361 bytes
-rw-r--r--html/assets/freedreno/glinfo_freedreno_preview.pngbin0 -> 55495 bytes
-rw-r--r--html/assets/style.css2
-rw-r--r--html/feed.xml50
-rw-r--r--html/graphics_feed.xml50
-rw-r--r--html/index.html7
-rw-r--r--html/notes/freedreno_journey.html94
-rw-r--r--notes/freedreno_journey.md72
9 files changed, 270 insertions, 5 deletions
diff --git a/html/assets/freedreno/3d-mark.png b/html/assets/freedreno/3d-mark.png
new file mode 100644
index 0000000..7b69df3
--- /dev/null
+++ b/html/assets/freedreno/3d-mark.png
Binary files differ
diff --git a/html/assets/freedreno/glinfo_freedreno.png b/html/assets/freedreno/glinfo_freedreno.png
new file mode 100644
index 0000000..abbb52b
--- /dev/null
+++ b/html/assets/freedreno/glinfo_freedreno.png
Binary files differ
diff --git a/html/assets/freedreno/glinfo_freedreno_preview.png b/html/assets/freedreno/glinfo_freedreno_preview.png
new file mode 100644
index 0000000..ab39307
--- /dev/null
+++ b/html/assets/freedreno/glinfo_freedreno_preview.png
Binary files differ
diff --git a/html/assets/style.css b/html/assets/style.css
index 7b4de1a..d9c40d8 100644
--- a/html/assets/style.css
+++ b/html/assets/style.css
@@ -60,7 +60,7 @@ img
{
display: block;
margin: auto;
- max-width: 100%;
+ max-width: 800px;
height: auto;
}
diff --git a/html/feed.xml b/html/feed.xml
index a526b2c..fd9e5dc 100644
--- a/html/feed.xml
+++ b/html/feed.xml
@@ -1,5 +1,5 @@
<?xml version='1.0' encoding='UTF-8'?>
-<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Fryzek Concepts</title><atom:link href="https://fryzekconcepts.com/feed.xml" rel="self" type="application/rss+xml"/><link>https://fryzekconcepts.com</link><description>Lucas is a developer working on cool things</description><lastBuildDate>Mon, 20 Feb 2023 19:31:26 -0000</lastBuildDate><item><title>Generating Video</title><link>https://fryzekconcepts.com/notes/generating-video.html</link><description>&lt;p&gt;One thing I’m very interested in is computer graphics. This could be complex 3D graphics or simple 2D graphics. The idea of getting a computer to display visual data fascinates me. One fundamental part of showing visual data is interfacing with a computer monitor. This can be accomplished by generating a video signal that the monitor understands. Below I have written instructions on how an FPGA can be used to generate a video signal. I have specifically worked with the iCEBreaker FPGA but the theory contained within this should work with any FPGA or device that you can generate the appropriate timings for.&lt;/p&gt;
+<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Fryzek Concepts</title><atom:link href="https://fryzekconcepts.com/feed.xml" rel="self" type="application/rss+xml"/><link>https://fryzekconcepts.com</link><description>Lucas is a developer working on cool things</description><lastBuildDate>Tue, 28 Feb 2023 19:54:00 -0000</lastBuildDate><item><title>Generating Video</title><link>https://fryzekconcepts.com/notes/generating-video.html</link><description>&lt;p&gt;One thing I’m very interested in is computer graphics. This could be complex 3D graphics or simple 2D graphics. The idea of getting a computer to display visual data fascinates me. One fundamental part of showing visual data is interfacing with a computer monitor. This can be accomplished by generating a video signal that the monitor understands. Below I have written instructions on how an FPGA can be used to generate a video signal. I have specifically worked with the iCEBreaker FPGA but the theory contained within this should work with any FPGA or device that you can generate the appropriate timings for.&lt;/p&gt;
&lt;h3 id="tools"&gt;Tools&lt;/h3&gt;
&lt;p&gt;Hardware used (&lt;a href="https://www.crowdsupply.com/1bitsquared/icebreaker-fpga"&gt;link for board&lt;/a&gt;):&lt;/p&gt;
&lt;ul&gt;
@@ -756,4 +756,50 @@ riscv64-unknown-elf-objcopy -O binary app.elf app.bin&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Another powerful feature of Godot is its physics engine. Godot makes it really easy to create physics objects in your scene and have them do interesting stuff. You might be wondering where physics comes into play in our game, and we actually use it for the root animations. I set up a sort of “rag doll” system for the roots to make them flop around in the air as the player moves, really giving a lot more “life” to an otherwise static game.&lt;/p&gt;
&lt;p&gt;Godot has a built in scripting language called “GDScript” which is very similar to Python. I’ve really grown to like this language. It has an optional type system you can take advantage of that helps with reducing the number of bugs that exist in your game. It also has great connectivity with the editor. This proved useful as I could “export” variables in the game and allow my team members to modify certain parameters of the game without knowing any programming. This is super helpful with balancing, and more easily allows non-technical members of team to contribute to the game logic in a more concrete way.&lt;/p&gt;
&lt;p&gt;Overall I’m very happy with how our game turned out. Last year I tried to participate in a few more game jams, but due to a combination of lack of personal motivation, poor team dynamics, and other factors, none of those game jams panned out. This was the first game jam in a while where I feel like I really connected with my team and I also feel like we made a super polished and fun game in the end.&lt;/p&gt;
-</description><pubDate>Sat, 11 Feb 2023 05:00:00 -0000</pubDate><guid>https://fryzekconcepts.com/notes/global_game_jam_2023.html</guid></item></channel></rss> \ No newline at end of file
+</description><pubDate>Sat, 11 Feb 2023 05:00:00 -0000</pubDate><guid>https://fryzekconcepts.com/notes/global_game_jam_2023.html</guid></item><item><title>Journey Through Freedreno</title><link>https://fryzekconcepts.com/notes/freedreno_journey.html</link><description>&lt;figure&gt;
+&lt;img src="/assets/freedreno/glinfo_freedreno.png" alt="Android running Freedreno" /&gt;&lt;figcaption aria-hidden="true"&gt;Android running Freedreno&lt;/figcaption&gt;
+&lt;/figure&gt;
+&lt;p&gt;As part of my training at Igalia I’ve been attempting to write a new backend for Freedreno that targets the proprietary “KGSL” kernel mode driver. For those unaware there are two “main” kernel mode drivers on Qualcomm SOCs for the GPU, there is the “MSM”, and “KGSL”. “MSM” is DRM compliant, and Freedreno already able to run on this driver. “KGSL” is the proprietary KMD that Qualcomm’s proprietary userspace driver targets. Now why would you want to run freedreno against KGSL, when MSM exists? Well there are a few ones, first MSM only really works on an up-streamed kernel, so if you have to run a down-streamed kernel you can continue using the version of KGSL that the manufacturer shipped with your device. Second this allows you to run both the proprietary adreno driver and the open source freedreno driver on the same device just by swapping libraries, which can be very nice for quickly testing something against both drivers.&lt;/p&gt;
+&lt;h2 id="when-drm-isnt-just-drm"&gt;When “DRM” isn’t just “DRM”&lt;/h2&gt;
+&lt;p&gt;When working on a new backend, one of the critical things to do is to make use of as much “common code” as possible. This has a number of benefits, least of all reducing the amount of code you have to write. It also allows reduces the number of bugs that will likely exist as you are relying on well tested code, and it ensures that the backend is mostly likely going to continue to work with new driver updates.&lt;/p&gt;
+&lt;p&gt;When I started the work for a new backend I looked inside mesa’s &lt;code&gt;src/freedreno/drm&lt;/code&gt; folder. This has the current backend code for Freedreno, and its already modularized to support multiple backends. It currently has support for the above mentioned MSM kernel mode driver as well as virtio (a backend that allows Freedreno to be used from within in a virtualized environment). From the name of this path, you would think that the code in this module would only work with kernel mode drivers that implement DRM, but actually there is only a handful of places in this module where DRM support is assumed. This made it a good starting point to introduce the KGSL backend and piggy back off the common code.&lt;/p&gt;
+&lt;p&gt;For example the &lt;code&gt;drm&lt;/code&gt; module has a lot of code to deal with the management of synchronization primitives, buffer objects, and command submit lists. All managed at a abstraction above “DRM” and to re-implement this code would be a bad idea.&lt;/p&gt;
+&lt;h2 id="how-to-get-android-to-behave"&gt;How to get Android to behave&lt;/h2&gt;
+&lt;p&gt;One of this big struggles with getting the KGSL backend working was figuring out how I could get Android to load mesa instead of Qualcomm blob driver that is shipped with the device image. Thankfully a good chunk of this work has already been figured out when the Turnip developers (Turnip is the open source Vulkan implementation for Adreno GPUs) figured out how to get Turnip running on android with KGSL. Thankfully one of my coworkers &lt;a href="https://blogs.igalia.com/dpiliaiev/"&gt;Danylo&lt;/a&gt; is one of those Turnip developers, and he gave me a lot of guidance on getting Android setup. One thing to watch out for is the outdated instructions &lt;a href="https://docs.mesa3d.org/android.html"&gt;here&lt;/a&gt;. These instructions &lt;em&gt;almost&lt;/em&gt; work, but require some modifications. First if you’re using a more modern version of the Android NDK, the compiler has been replaced with LLVM/Clang, so you need to change which compiler is being used. Second flags like &lt;code&gt;system&lt;/code&gt; in the cross compiler script incorrectly set the system as &lt;code&gt;linux&lt;/code&gt; instead of &lt;code&gt;android&lt;/code&gt;. I had success using the below cross compiler script. Take note that the compiler paths need to be updated to match where you extracted the android NDK on your system.&lt;/p&gt;
+&lt;pre class="meson"&gt;&lt;code&gt;[binaries]
+ar = &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-ar&amp;#39;
+c = [&amp;#39;ccache&amp;#39;, &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang&amp;#39;]
+cpp = [&amp;#39;ccache&amp;#39;, &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang++&amp;#39;, &amp;#39;-fno-exceptions&amp;#39;, &amp;#39;-fno-unwind-tables&amp;#39;, &amp;#39;-fno-asynchronous-unwind-tables&amp;#39;, &amp;#39;-static-libstdc++&amp;#39;]
+c_ld = &amp;#39;lld&amp;#39;
+cpp_ld = &amp;#39;lld&amp;#39;
+strip = &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-strip&amp;#39;
+# Android doesn&amp;#39;t come with a pkg-config, but we need one for Meson to be happy not
+# finding all the optional deps it looks for. Use system pkg-config pointing at a
+# directory we get to populate with any .pc files we want to add for Android
+pkgconfig = [&amp;#39;env&amp;#39;, &amp;#39;PKG_CONFIG_LIBDIR=/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/pkgconfig:/home/lfryzek/Documents/projects/igalia/freedreno/install-android/lib/pkgconfig&amp;#39;, &amp;#39;/usr/bin/pkg-config&amp;#39;]
+
+[host_machine]
+system = &amp;#39;android&amp;#39;
+cpu_family = &amp;#39;arm&amp;#39;
+cpu = &amp;#39;armv8&amp;#39;
+endian = &amp;#39;little&amp;#39;&lt;/code&gt;&lt;/pre&gt;
+&lt;p&gt;Another thing I had to figure out with Android, that was different with these instructions, was how I would get Android to load mesa versions of mesa libraries. That’s when my colleague &lt;a href="https://www.igalia.com/team/mark"&gt;Mark&lt;/a&gt; pointed out to me that Android is open source and I could just check the source code myself. Sure enough you have find the OpenGL driver loader in &lt;a href="https://android.googlesource.com/platform/frameworks/native/+/master/opengl/libs/EGL/Loader.cpp"&gt;Android’s source code&lt;/a&gt;. From this code we can that Android will try to load a few different files based on some settings, and in my case it would try to load 3 different shaded libraries in the &lt;code&gt;/vendor/lib64/egl&lt;/code&gt; folder, &lt;code&gt;libEGL_adreno.so&lt;/code&gt; ,&lt;code&gt;libGLESv1_CM_adreno.so&lt;/code&gt;, and &lt;code&gt;libGLESv2.so&lt;/code&gt;. I could just replace these libraries with the version built from mesa and voilà, you’re now loading a custom driver! This realization that I could just “read the code” was very powerful in debugging some more android specific issues I ran into, like dealing with gralloc.&lt;/p&gt;
+&lt;p&gt;Something cool that the opensource Freedreno &amp;amp; Turnip driver developers figured out was getting android to run test OpenGL applications from the adb shell without building android APKs. If you check out the &lt;a href="https://gitlab.freedesktop.org/freedreno/freedreno"&gt;freedreno repo&lt;/a&gt;, they have an &lt;code&gt;ndk-build.sh&lt;/code&gt; script that can build tests in the &lt;code&gt;tests-*&lt;/code&gt; folder. The nice benefit of this is that it provides an easy way to run simple test cases without worrying about the android window system integration. Another nifty feature about this repo is the &lt;code&gt;libwrap&lt;/code&gt; tool that lets trace the commands being submitted to the GPU.&lt;/p&gt;
+&lt;h2 id="what-even-is-gralloc"&gt;What even is Gralloc?&lt;/h2&gt;
+&lt;p&gt;Gralloc is the graphics memory allocated in Android, and the OS will use it to allocate the surface for “windows”. This means that the memory we want to render the display to is managed by gralloc and not our KGSL backend. This means we have to get all the information about this surface from gralloc, and if you look in &lt;code&gt;src/egl/driver/dri2/platform_android.c&lt;/code&gt; you will see existing code for handing gralloc. You would think “Hey there is no work for me here then”, but you would be wrong. The handle gralloc provides is hardware specific, and the code in &lt;code&gt;platform_android.c&lt;/code&gt; assumes a DRM gralloc implementation. Thankfully the turnip developers had already gone through this struggle and if you look in &lt;code&gt;src/freedreno/vulkan/tu_android.c&lt;/code&gt; you can see they have implemented a separate path when a Qualcomm msm implementation of gralloc is detected. I could copy this detection logic and add a separate path to &lt;code&gt;platform_android.c&lt;/code&gt;.&lt;/p&gt;
+&lt;h2 id="working-with-the-freedreno-community"&gt;Working with the Freedreno community&lt;/h2&gt;
+&lt;p&gt;When working on any project (open-source or otherwise), it’s nice to know that you aren’t working alone. Thankfully the &lt;code&gt;#freedreno&lt;/code&gt; channel on &lt;code&gt;irc.oftc.net&lt;/code&gt; is very active and full of helpful people to answer any questions you may have. While working on the backend, one area I wasn’t really sure how to address was the synchronization code for buffer objects. The backend exposed a function called &lt;code&gt;cpu_prep&lt;/code&gt;, This function was just there to call the DRM implementation of &lt;code&gt;cpu_prep&lt;/code&gt; on the buffer object. I wasn’t exactly sure how to implement this functionality with KGSL since it doesn’t use DRM buffer objects.&lt;/p&gt;
+&lt;p&gt;I ended up reaching out to the IRC channel and Rob Clark on the channel explained to me that he was actually working on moving a lot of the code for &lt;code&gt;cpu_prep&lt;/code&gt; into common code so that a non-drm driver (like the KGSL backend I was working on) would just need to implement that operation as NOP (no operation).&lt;/p&gt;
+&lt;h2 id="dealing-with-bugs-reverse-engineering-the-blob"&gt;Dealing with bugs &amp;amp; reverse engineering the blob&lt;/h2&gt;
+&lt;p&gt;I encountered a few different bugs when implementing the KGSL backend, but most of them consisted of me calling KGSL wrong, or handing synchronization incorrectly. Thankfully since Turnip is already running on KGSL, I could just more carefully compare my code to what Turnip is doing and figure out my logical mistake.&lt;/p&gt;
+&lt;p&gt;Some of the bugs I encountered required the backend interface in Freedreno to be modified to expose per a new per driver implementation of that backend function, instead of just using a common implementation. For example the existing function to map a buffer object into userspace assumed that the same &lt;code&gt;fd&lt;/code&gt; for the device could be used for the buffer object in the &lt;code&gt;mmap&lt;/code&gt; call. This worked fine for any buffer objects we created through KGSL but would not work for buffer objects created from gralloc (remember the above section on surface memory for windows comming from gralloc). To resolve this issue I exposed a new per backend implementation of “map” where I could take a different path if the buffer object came from gralloc.&lt;/p&gt;
+&lt;p&gt;While testing the KGSL backend I did encounter a new bug that seems to effect both my new KGSL backend and the Turnip KGSL backend. The bug is an &lt;code&gt;iommu fault&lt;/code&gt; that occurs when the surface allocated by gralloc does not have a height that is aligned to 4. The blitting engine on a6xx GPUs copies in 16x4 chunks, so if the height is not aligned by 4 the GPU will try to write to pixels that exists outside the allocated memory. This issue only happens with KGSL backends since we import memory from gralloc, and gralloc allocates exactly enough memory for the surface, with no alignment on the height. If running on any other platform, the &lt;code&gt;fdl&lt;/code&gt; (Freedreno Layout) code would be called to compute the minimum required size for a surface which would take into account the alignment requirement for the height. The blob driver Qualcomm didn’t seem to have this problem, even though its getting the exact same buffer from gralloc. So it must be doing something different to handle the none aligned height.&lt;/p&gt;
+&lt;p&gt;Because this issue relied on gralloc, the application needed to running as an Android APK to get a surface from gralloc. The best way to fix this issue would be to figure out what the blob driver is doing and try to replicate this behavior in Freedreno (assuming it isn’t doing something silly like switch to sysmem rendering). Unfortunately it didn’t look like the libwrap library worked to trace an APK.&lt;/p&gt;
+&lt;p&gt;The libwrap library relied on a linux feature known as &lt;code&gt;LD_PRELOAD&lt;/code&gt; to load &lt;code&gt;libwrap.so&lt;/code&gt; when the application starts and replace the system functions like &lt;code&gt;open&lt;/code&gt; and &lt;code&gt;ioctl&lt;/code&gt; with their own implementation that traces what is being submitted to the KGSL kernel mode driver. Thankfully android exposes this &lt;code&gt;LD_PRELOAD&lt;/code&gt; mechanism through its “wrap” interface where you create a propety called &lt;code&gt;wrap.&amp;lt;app-name&amp;gt;&lt;/code&gt; with a value &lt;code&gt;LD_PRELOAD=&amp;lt;path to libwrap.so&amp;gt;&lt;/code&gt;. Android will then load your library like would be done in a normal linux shell. If you tried to do this with libwrap though you find very quickly that you would get corrupted traces. When android launches your APK, it doesn’t only launch your application, there are different threads for different android system related functions and some of them can also use OpenGL. The libwrap library is not designed to handle multiple threads using KGSL at the same time. After discovering this issue I created a &lt;a href="https://gitlab.freedesktop.org/freedreno/freedreno/-/merge_requests/22"&gt;MR&lt;/a&gt; that would store the tracing file handles as TLS (thread local storage) preventing the clobbering of the trace file, and also allowing you to view the traces generated by different threads separately from each other.&lt;/p&gt;
+&lt;p&gt;With this is in hand one could begin investing what the blob driver is doing to handle this unaligned surfaces.&lt;/p&gt;
+&lt;h2 id="whats-next"&gt;What’s next?&lt;/h2&gt;
+&lt;p&gt;Well the next obvious thing to fix is the aligned height issue which is still open. I’ve also worked on upstreaming my changes with this &lt;a href="https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21570"&gt;WIP MR&lt;/a&gt;.&lt;/p&gt;
+&lt;figure&gt;
+&lt;img src="/assets/freedreno/3d-mark.png" alt="Freedreno running 3d-mark" /&gt;&lt;figcaption aria-hidden="true"&gt;Freedreno running 3d-mark&lt;/figcaption&gt;
+&lt;/figure&gt;
+</description><pubDate>Tue, 28 Feb 2023 05:00:00 -0000</pubDate><guid>https://fryzekconcepts.com/notes/freedreno_journey.html</guid></item></channel></rss> \ No newline at end of file
diff --git a/html/graphics_feed.xml b/html/graphics_feed.xml
index 9aa1ca6..b776e9b 100644
--- a/html/graphics_feed.xml
+++ b/html/graphics_feed.xml
@@ -1,5 +1,5 @@
<?xml version='1.0' encoding='UTF-8'?>
-<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Fryzek Concepts</title><atom:link href="https://fryzekconcepts.com/feed.xml" rel="self" type="application/rss+xml"/><link>https://fryzekconcepts.com</link><description>Lucas is a developer working on cool things</description><lastBuildDate>Mon, 20 Feb 2023 19:31:26 -0000</lastBuildDate><item><title>2022 Graphics Team Contributions at Igalia</title><link>https://fryzekconcepts.com/notes/2022_igalia_graphics_team.html</link><description>&lt;p&gt;This year I started a new job working with &lt;a href="https://www.igalia.com/technology/graphics"&gt;Igalia’s Graphics Team&lt;/a&gt;. For those of you who don’t know &lt;a href="https://www.igalia.com/"&gt;Igalia&lt;/a&gt; they are a &lt;a href="https://en.wikipedia.org/wiki/Igalia"&gt;“worker-owned, employee-run cooperative model consultancy focused on open source software”&lt;/a&gt;.&lt;/p&gt;
+<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Fryzek Concepts</title><atom:link href="https://fryzekconcepts.com/feed.xml" rel="self" type="application/rss+xml"/><link>https://fryzekconcepts.com</link><description>Lucas is a developer working on cool things</description><lastBuildDate>Tue, 28 Feb 2023 19:54:01 -0000</lastBuildDate><item><title>2022 Graphics Team Contributions at Igalia</title><link>https://fryzekconcepts.com/notes/2022_igalia_graphics_team.html</link><description>&lt;p&gt;This year I started a new job working with &lt;a href="https://www.igalia.com/technology/graphics"&gt;Igalia’s Graphics Team&lt;/a&gt;. For those of you who don’t know &lt;a href="https://www.igalia.com/"&gt;Igalia&lt;/a&gt; they are a &lt;a href="https://en.wikipedia.org/wiki/Igalia"&gt;“worker-owned, employee-run cooperative model consultancy focused on open source software”&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;As a new member of the team, I thought it would be a great idea to summarize the incredible amount of work the team completed in 2022. If you’re interested keep reading!&lt;/p&gt;
&lt;h2 id="vulkan-1.2-conformance-on-rpi-4"&gt;Vulkan 1.2 Conformance on RPi 4&lt;/h2&gt;
&lt;p&gt;One of the big milestones for the team in 2022 was &lt;a href="https://www.khronos.org/conformance/adopters/conformant-products#submission_694"&gt;achieving Vulkan 1.2 conformance on the Raspberry Pi 4&lt;/a&gt;. The folks over at the Raspberry Pi company wrote a nice &lt;a href="https://www.raspberrypi.com/news/vulkan-update-version-1-2-conformance-for-raspberry-pi-4/"&gt;article&lt;/a&gt; about the achievement. Igalia has been partnering with the Raspberry Pi company to bring build and improve the graphics driver on all versions of the Raspberry Pi.&lt;/p&gt;
@@ -80,4 +80,50 @@
&lt;figure&gt;
&lt;img src="https://www.igalia.com/assets/i/news/XDC-event-banner.jpg" alt="Photo of A Coruña" /&gt;&lt;figcaption aria-hidden="true"&gt;Photo of A Coruña&lt;/figcaption&gt;
&lt;/figure&gt;
-</description><pubDate>Thu, 02 Feb 2023 05:00:00 -0000</pubDate><guid>https://fryzekconcepts.com/notes/2022_igalia_graphics_team.html</guid></item></channel></rss> \ No newline at end of file
+</description><pubDate>Thu, 02 Feb 2023 05:00:00 -0000</pubDate><guid>https://fryzekconcepts.com/notes/2022_igalia_graphics_team.html</guid></item><item><title>Journey Through Freedreno</title><link>https://fryzekconcepts.com/notes/freedreno_journey.html</link><description>&lt;figure&gt;
+&lt;img src="/assets/freedreno/glinfo_freedreno.png" alt="Android running Freedreno" /&gt;&lt;figcaption aria-hidden="true"&gt;Android running Freedreno&lt;/figcaption&gt;
+&lt;/figure&gt;
+&lt;p&gt;As part of my training at Igalia I’ve been attempting to write a new backend for Freedreno that targets the proprietary “KGSL” kernel mode driver. For those unaware there are two “main” kernel mode drivers on Qualcomm SOCs for the GPU, there is the “MSM”, and “KGSL”. “MSM” is DRM compliant, and Freedreno already able to run on this driver. “KGSL” is the proprietary KMD that Qualcomm’s proprietary userspace driver targets. Now why would you want to run freedreno against KGSL, when MSM exists? Well there are a few ones, first MSM only really works on an up-streamed kernel, so if you have to run a down-streamed kernel you can continue using the version of KGSL that the manufacturer shipped with your device. Second this allows you to run both the proprietary adreno driver and the open source freedreno driver on the same device just by swapping libraries, which can be very nice for quickly testing something against both drivers.&lt;/p&gt;
+&lt;h2 id="when-drm-isnt-just-drm"&gt;When “DRM” isn’t just “DRM”&lt;/h2&gt;
+&lt;p&gt;When working on a new backend, one of the critical things to do is to make use of as much “common code” as possible. This has a number of benefits, least of all reducing the amount of code you have to write. It also allows reduces the number of bugs that will likely exist as you are relying on well tested code, and it ensures that the backend is mostly likely going to continue to work with new driver updates.&lt;/p&gt;
+&lt;p&gt;When I started the work for a new backend I looked inside mesa’s &lt;code&gt;src/freedreno/drm&lt;/code&gt; folder. This has the current backend code for Freedreno, and its already modularized to support multiple backends. It currently has support for the above mentioned MSM kernel mode driver as well as virtio (a backend that allows Freedreno to be used from within in a virtualized environment). From the name of this path, you would think that the code in this module would only work with kernel mode drivers that implement DRM, but actually there is only a handful of places in this module where DRM support is assumed. This made it a good starting point to introduce the KGSL backend and piggy back off the common code.&lt;/p&gt;
+&lt;p&gt;For example the &lt;code&gt;drm&lt;/code&gt; module has a lot of code to deal with the management of synchronization primitives, buffer objects, and command submit lists. All managed at a abstraction above “DRM” and to re-implement this code would be a bad idea.&lt;/p&gt;
+&lt;h2 id="how-to-get-android-to-behave"&gt;How to get Android to behave&lt;/h2&gt;
+&lt;p&gt;One of this big struggles with getting the KGSL backend working was figuring out how I could get Android to load mesa instead of Qualcomm blob driver that is shipped with the device image. Thankfully a good chunk of this work has already been figured out when the Turnip developers (Turnip is the open source Vulkan implementation for Adreno GPUs) figured out how to get Turnip running on android with KGSL. Thankfully one of my coworkers &lt;a href="https://blogs.igalia.com/dpiliaiev/"&gt;Danylo&lt;/a&gt; is one of those Turnip developers, and he gave me a lot of guidance on getting Android setup. One thing to watch out for is the outdated instructions &lt;a href="https://docs.mesa3d.org/android.html"&gt;here&lt;/a&gt;. These instructions &lt;em&gt;almost&lt;/em&gt; work, but require some modifications. First if you’re using a more modern version of the Android NDK, the compiler has been replaced with LLVM/Clang, so you need to change which compiler is being used. Second flags like &lt;code&gt;system&lt;/code&gt; in the cross compiler script incorrectly set the system as &lt;code&gt;linux&lt;/code&gt; instead of &lt;code&gt;android&lt;/code&gt;. I had success using the below cross compiler script. Take note that the compiler paths need to be updated to match where you extracted the android NDK on your system.&lt;/p&gt;
+&lt;pre class="meson"&gt;&lt;code&gt;[binaries]
+ar = &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-ar&amp;#39;
+c = [&amp;#39;ccache&amp;#39;, &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang&amp;#39;]
+cpp = [&amp;#39;ccache&amp;#39;, &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang++&amp;#39;, &amp;#39;-fno-exceptions&amp;#39;, &amp;#39;-fno-unwind-tables&amp;#39;, &amp;#39;-fno-asynchronous-unwind-tables&amp;#39;, &amp;#39;-static-libstdc++&amp;#39;]
+c_ld = &amp;#39;lld&amp;#39;
+cpp_ld = &amp;#39;lld&amp;#39;
+strip = &amp;#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-strip&amp;#39;
+# Android doesn&amp;#39;t come with a pkg-config, but we need one for Meson to be happy not
+# finding all the optional deps it looks for. Use system pkg-config pointing at a
+# directory we get to populate with any .pc files we want to add for Android
+pkgconfig = [&amp;#39;env&amp;#39;, &amp;#39;PKG_CONFIG_LIBDIR=/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/pkgconfig:/home/lfryzek/Documents/projects/igalia/freedreno/install-android/lib/pkgconfig&amp;#39;, &amp;#39;/usr/bin/pkg-config&amp;#39;]
+
+[host_machine]
+system = &amp;#39;android&amp;#39;
+cpu_family = &amp;#39;arm&amp;#39;
+cpu = &amp;#39;armv8&amp;#39;
+endian = &amp;#39;little&amp;#39;&lt;/code&gt;&lt;/pre&gt;
+&lt;p&gt;Another thing I had to figure out with Android, that was different with these instructions, was how I would get Android to load mesa versions of mesa libraries. That’s when my colleague &lt;a href="https://www.igalia.com/team/mark"&gt;Mark&lt;/a&gt; pointed out to me that Android is open source and I could just check the source code myself. Sure enough you have find the OpenGL driver loader in &lt;a href="https://android.googlesource.com/platform/frameworks/native/+/master/opengl/libs/EGL/Loader.cpp"&gt;Android’s source code&lt;/a&gt;. From this code we can that Android will try to load a few different files based on some settings, and in my case it would try to load 3 different shaded libraries in the &lt;code&gt;/vendor/lib64/egl&lt;/code&gt; folder, &lt;code&gt;libEGL_adreno.so&lt;/code&gt; ,&lt;code&gt;libGLESv1_CM_adreno.so&lt;/code&gt;, and &lt;code&gt;libGLESv2.so&lt;/code&gt;. I could just replace these libraries with the version built from mesa and voilà, you’re now loading a custom driver! This realization that I could just “read the code” was very powerful in debugging some more android specific issues I ran into, like dealing with gralloc.&lt;/p&gt;
+&lt;p&gt;Something cool that the opensource Freedreno &amp;amp; Turnip driver developers figured out was getting android to run test OpenGL applications from the adb shell without building android APKs. If you check out the &lt;a href="https://gitlab.freedesktop.org/freedreno/freedreno"&gt;freedreno repo&lt;/a&gt;, they have an &lt;code&gt;ndk-build.sh&lt;/code&gt; script that can build tests in the &lt;code&gt;tests-*&lt;/code&gt; folder. The nice benefit of this is that it provides an easy way to run simple test cases without worrying about the android window system integration. Another nifty feature about this repo is the &lt;code&gt;libwrap&lt;/code&gt; tool that lets trace the commands being submitted to the GPU.&lt;/p&gt;
+&lt;h2 id="what-even-is-gralloc"&gt;What even is Gralloc?&lt;/h2&gt;
+&lt;p&gt;Gralloc is the graphics memory allocated in Android, and the OS will use it to allocate the surface for “windows”. This means that the memory we want to render the display to is managed by gralloc and not our KGSL backend. This means we have to get all the information about this surface from gralloc, and if you look in &lt;code&gt;src/egl/driver/dri2/platform_android.c&lt;/code&gt; you will see existing code for handing gralloc. You would think “Hey there is no work for me here then”, but you would be wrong. The handle gralloc provides is hardware specific, and the code in &lt;code&gt;platform_android.c&lt;/code&gt; assumes a DRM gralloc implementation. Thankfully the turnip developers had already gone through this struggle and if you look in &lt;code&gt;src/freedreno/vulkan/tu_android.c&lt;/code&gt; you can see they have implemented a separate path when a Qualcomm msm implementation of gralloc is detected. I could copy this detection logic and add a separate path to &lt;code&gt;platform_android.c&lt;/code&gt;.&lt;/p&gt;
+&lt;h2 id="working-with-the-freedreno-community"&gt;Working with the Freedreno community&lt;/h2&gt;
+&lt;p&gt;When working on any project (open-source or otherwise), it’s nice to know that you aren’t working alone. Thankfully the &lt;code&gt;#freedreno&lt;/code&gt; channel on &lt;code&gt;irc.oftc.net&lt;/code&gt; is very active and full of helpful people to answer any questions you may have. While working on the backend, one area I wasn’t really sure how to address was the synchronization code for buffer objects. The backend exposed a function called &lt;code&gt;cpu_prep&lt;/code&gt;, This function was just there to call the DRM implementation of &lt;code&gt;cpu_prep&lt;/code&gt; on the buffer object. I wasn’t exactly sure how to implement this functionality with KGSL since it doesn’t use DRM buffer objects.&lt;/p&gt;
+&lt;p&gt;I ended up reaching out to the IRC channel and Rob Clark on the channel explained to me that he was actually working on moving a lot of the code for &lt;code&gt;cpu_prep&lt;/code&gt; into common code so that a non-drm driver (like the KGSL backend I was working on) would just need to implement that operation as NOP (no operation).&lt;/p&gt;
+&lt;h2 id="dealing-with-bugs-reverse-engineering-the-blob"&gt;Dealing with bugs &amp;amp; reverse engineering the blob&lt;/h2&gt;
+&lt;p&gt;I encountered a few different bugs when implementing the KGSL backend, but most of them consisted of me calling KGSL wrong, or handing synchronization incorrectly. Thankfully since Turnip is already running on KGSL, I could just more carefully compare my code to what Turnip is doing and figure out my logical mistake.&lt;/p&gt;
+&lt;p&gt;Some of the bugs I encountered required the backend interface in Freedreno to be modified to expose per a new per driver implementation of that backend function, instead of just using a common implementation. For example the existing function to map a buffer object into userspace assumed that the same &lt;code&gt;fd&lt;/code&gt; for the device could be used for the buffer object in the &lt;code&gt;mmap&lt;/code&gt; call. This worked fine for any buffer objects we created through KGSL but would not work for buffer objects created from gralloc (remember the above section on surface memory for windows comming from gralloc). To resolve this issue I exposed a new per backend implementation of “map” where I could take a different path if the buffer object came from gralloc.&lt;/p&gt;
+&lt;p&gt;While testing the KGSL backend I did encounter a new bug that seems to effect both my new KGSL backend and the Turnip KGSL backend. The bug is an &lt;code&gt;iommu fault&lt;/code&gt; that occurs when the surface allocated by gralloc does not have a height that is aligned to 4. The blitting engine on a6xx GPUs copies in 16x4 chunks, so if the height is not aligned by 4 the GPU will try to write to pixels that exists outside the allocated memory. This issue only happens with KGSL backends since we import memory from gralloc, and gralloc allocates exactly enough memory for the surface, with no alignment on the height. If running on any other platform, the &lt;code&gt;fdl&lt;/code&gt; (Freedreno Layout) code would be called to compute the minimum required size for a surface which would take into account the alignment requirement for the height. The blob driver Qualcomm didn’t seem to have this problem, even though its getting the exact same buffer from gralloc. So it must be doing something different to handle the none aligned height.&lt;/p&gt;
+&lt;p&gt;Because this issue relied on gralloc, the application needed to running as an Android APK to get a surface from gralloc. The best way to fix this issue would be to figure out what the blob driver is doing and try to replicate this behavior in Freedreno (assuming it isn’t doing something silly like switch to sysmem rendering). Unfortunately it didn’t look like the libwrap library worked to trace an APK.&lt;/p&gt;
+&lt;p&gt;The libwrap library relied on a linux feature known as &lt;code&gt;LD_PRELOAD&lt;/code&gt; to load &lt;code&gt;libwrap.so&lt;/code&gt; when the application starts and replace the system functions like &lt;code&gt;open&lt;/code&gt; and &lt;code&gt;ioctl&lt;/code&gt; with their own implementation that traces what is being submitted to the KGSL kernel mode driver. Thankfully android exposes this &lt;code&gt;LD_PRELOAD&lt;/code&gt; mechanism through its “wrap” interface where you create a propety called &lt;code&gt;wrap.&amp;lt;app-name&amp;gt;&lt;/code&gt; with a value &lt;code&gt;LD_PRELOAD=&amp;lt;path to libwrap.so&amp;gt;&lt;/code&gt;. Android will then load your library like would be done in a normal linux shell. If you tried to do this with libwrap though you find very quickly that you would get corrupted traces. When android launches your APK, it doesn’t only launch your application, there are different threads for different android system related functions and some of them can also use OpenGL. The libwrap library is not designed to handle multiple threads using KGSL at the same time. After discovering this issue I created a &lt;a href="https://gitlab.freedesktop.org/freedreno/freedreno/-/merge_requests/22"&gt;MR&lt;/a&gt; that would store the tracing file handles as TLS (thread local storage) preventing the clobbering of the trace file, and also allowing you to view the traces generated by different threads separately from each other.&lt;/p&gt;
+&lt;p&gt;With this is in hand one could begin investing what the blob driver is doing to handle this unaligned surfaces.&lt;/p&gt;
+&lt;h2 id="whats-next"&gt;What’s next?&lt;/h2&gt;
+&lt;p&gt;Well the next obvious thing to fix is the aligned height issue which is still open. I’ve also worked on upstreaming my changes with this &lt;a href="https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21570"&gt;WIP MR&lt;/a&gt;.&lt;/p&gt;
+&lt;figure&gt;
+&lt;img src="/assets/freedreno/3d-mark.png" alt="Freedreno running 3d-mark" /&gt;&lt;figcaption aria-hidden="true"&gt;Freedreno running 3d-mark&lt;/figcaption&gt;
+&lt;/figure&gt;
+</description><pubDate>Tue, 28 Feb 2023 05:00:00 -0000</pubDate><guid>https://fryzekconcepts.com/notes/freedreno_journey.html</guid></item></channel></rss> \ No newline at end of file
diff --git a/html/index.html b/html/index.html
index bf37882..957740d 100644
--- a/html/index.html
+++ b/html/index.html
@@ -44,6 +44,13 @@
</a>
</h2>
<div class="notes-container">
+ <a href="/notes/freedreno_journey.html" class="note-link">
+ <div class="note-box">
+ <img src="/assets/freedreno/glinfo_freedreno_preview.png">
+ <h2>Journey Through Freedreno</h2>
+ <p>Android running FreedrenoAs part of my training at Igalia I’ve been attempting to write a new ...</p>
+ </div>
+ </a>
<a href="/notes/rasterizing-triangles.html" class="note-link">
<div class="note-box">
<img src="/assets/2022-04-03-rasterizing-triangles/Screenshot-from-2022-04-03-13-43-13.png">
diff --git a/html/notes/freedreno_journey.html b/html/notes/freedreno_journey.html
new file mode 100644
index 0000000..03a7dc4
--- /dev/null
+++ b/html/notes/freedreno_journey.html
@@ -0,0 +1,94 @@
+<!doctype html>
+
+<html class="html-note-page" lang="en">
+<head>
+ <meta charset="utf-8">
+ <meta name="viewport" content="width=device-width, initial-scale=1">
+
+ <title>Journey Through Freedreno</title>
+ <meta name="dcterms.date" content="2023-02-28" />
+
+ <link rel="stylesheet" href="/assets/style.css">
+ <link rel="icon" type="image/x-icon" href="/assets/favicon.svg">
+ <link rel="alternate" type="application/atom+xml" title="Fryzek Concepts" href="/feed.xml">
+</head>
+
+<body>
+ <div class="header-bar">
+ <a href="/index.html">
+ <img src="/assets/favicon.svg" alt="frycon logo">
+ </a>
+ <div class="header-links">
+ <a href="/now.html" class="header-link">Now</a>
+ <a href="/about.html" class="header-link">About</a>
+ </div>
+ </div>
+ <main>
+<div class="page-title-header-container">
+ <h1 class="page-title-header">Journey Through Freedreno</h1>
+ <div class="page-info-container">
+ <div class="plant-status">
+ <img src="/assets/budding.svg">
+ <div class="plant-status-text">
+ <p>budding</p>
+ </div>
+ </div>
+ <div class="page-info-date-container">
+ <p class="page-info-date">Published: 2023-02-28</p>
+ <p class="page-info-date">Last Edited: 2023-02-28</p>
+ </div>
+ </div>
+ </div>
+<div class="note-divider"></div>
+<div class="main-container">
+ <div class="note-body">
+<figure>
+<img src="/assets/freedreno/glinfo_freedreno.png" alt="Android running Freedreno" /><figcaption aria-hidden="true">Android running Freedreno</figcaption>
+</figure>
+<p>As part of my training at Igalia I’ve been attempting to write a new backend for Freedreno that targets the proprietary “KGSL” kernel mode driver. For those unaware there are two “main” kernel mode drivers on Qualcomm SOCs for the GPU, there is the “MSM”, and “KGSL”. “MSM” is DRM compliant, and Freedreno already able to run on this driver. “KGSL” is the proprietary KMD that Qualcomm’s proprietary userspace driver targets. Now why would you want to run freedreno against KGSL, when MSM exists? Well there are a few ones, first MSM only really works on an up-streamed kernel, so if you have to run a down-streamed kernel you can continue using the version of KGSL that the manufacturer shipped with your device. Second this allows you to run both the proprietary adreno driver and the open source freedreno driver on the same device just by swapping libraries, which can be very nice for quickly testing something against both drivers.</p>
+<h2 id="when-drm-isnt-just-drm">When “DRM” isn’t just “DRM”</h2>
+<p>When working on a new backend, one of the critical things to do is to make use of as much “common code” as possible. This has a number of benefits, least of all reducing the amount of code you have to write. It also allows reduces the number of bugs that will likely exist as you are relying on well tested code, and it ensures that the backend is mostly likely going to continue to work with new driver updates.</p>
+<p>When I started the work for a new backend I looked inside mesa’s <code>src/freedreno/drm</code> folder. This has the current backend code for Freedreno, and its already modularized to support multiple backends. It currently has support for the above mentioned MSM kernel mode driver as well as virtio (a backend that allows Freedreno to be used from within in a virtualized environment). From the name of this path, you would think that the code in this module would only work with kernel mode drivers that implement DRM, but actually there is only a handful of places in this module where DRM support is assumed. This made it a good starting point to introduce the KGSL backend and piggy back off the common code.</p>
+<p>For example the <code>drm</code> module has a lot of code to deal with the management of synchronization primitives, buffer objects, and command submit lists. All managed at a abstraction above “DRM” and to re-implement this code would be a bad idea.</p>
+<h2 id="how-to-get-android-to-behave">How to get Android to behave</h2>
+<p>One of this big struggles with getting the KGSL backend working was figuring out how I could get Android to load mesa instead of Qualcomm blob driver that is shipped with the device image. Thankfully a good chunk of this work has already been figured out when the Turnip developers (Turnip is the open source Vulkan implementation for Adreno GPUs) figured out how to get Turnip running on android with KGSL. Thankfully one of my coworkers <a href="https://blogs.igalia.com/dpiliaiev/">Danylo</a> is one of those Turnip developers, and he gave me a lot of guidance on getting Android setup. One thing to watch out for is the outdated instructions <a href="https://docs.mesa3d.org/android.html">here</a>. These instructions <em>almost</em> work, but require some modifications. First if you’re using a more modern version of the Android NDK, the compiler has been replaced with LLVM/Clang, so you need to change which compiler is being used. Second flags like <code>system</code> in the cross compiler script incorrectly set the system as <code>linux</code> instead of <code>android</code>. I had success using the below cross compiler script. Take note that the compiler paths need to be updated to match where you extracted the android NDK on your system.</p>
+<pre class="meson"><code>[binaries]
+ar = &#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-ar&#39;
+c = [&#39;ccache&#39;, &#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang&#39;]
+cpp = [&#39;ccache&#39;, &#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang++&#39;, &#39;-fno-exceptions&#39;, &#39;-fno-unwind-tables&#39;, &#39;-fno-asynchronous-unwind-tables&#39;, &#39;-static-libstdc++&#39;]
+c_ld = &#39;lld&#39;
+cpp_ld = &#39;lld&#39;
+strip = &#39;/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-strip&#39;
+# Android doesn&#39;t come with a pkg-config, but we need one for Meson to be happy not
+# finding all the optional deps it looks for. Use system pkg-config pointing at a
+# directory we get to populate with any .pc files we want to add for Android
+pkgconfig = [&#39;env&#39;, &#39;PKG_CONFIG_LIBDIR=/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/pkgconfig:/home/lfryzek/Documents/projects/igalia/freedreno/install-android/lib/pkgconfig&#39;, &#39;/usr/bin/pkg-config&#39;]
+
+[host_machine]
+system = &#39;android&#39;
+cpu_family = &#39;arm&#39;
+cpu = &#39;armv8&#39;
+endian = &#39;little&#39;</code></pre>
+<p>Another thing I had to figure out with Android, that was different with these instructions, was how I would get Android to load mesa versions of mesa libraries. That’s when my colleague <a href="https://www.igalia.com/team/mark">Mark</a> pointed out to me that Android is open source and I could just check the source code myself. Sure enough you have find the OpenGL driver loader in <a href="https://android.googlesource.com/platform/frameworks/native/+/master/opengl/libs/EGL/Loader.cpp">Android’s source code</a>. From this code we can that Android will try to load a few different files based on some settings, and in my case it would try to load 3 different shaded libraries in the <code>/vendor/lib64/egl</code> folder, <code>libEGL_adreno.so</code> ,<code>libGLESv1_CM_adreno.so</code>, and <code>libGLESv2.so</code>. I could just replace these libraries with the version built from mesa and voilà, you’re now loading a custom driver! This realization that I could just “read the code” was very powerful in debugging some more android specific issues I ran into, like dealing with gralloc.</p>
+<p>Something cool that the opensource Freedreno &amp; Turnip driver developers figured out was getting android to run test OpenGL applications from the adb shell without building android APKs. If you check out the <a href="https://gitlab.freedesktop.org/freedreno/freedreno">freedreno repo</a>, they have an <code>ndk-build.sh</code> script that can build tests in the <code>tests-*</code> folder. The nice benefit of this is that it provides an easy way to run simple test cases without worrying about the android window system integration. Another nifty feature about this repo is the <code>libwrap</code> tool that lets trace the commands being submitted to the GPU.</p>
+<h2 id="what-even-is-gralloc">What even is Gralloc?</h2>
+<p>Gralloc is the graphics memory allocated in Android, and the OS will use it to allocate the surface for “windows”. This means that the memory we want to render the display to is managed by gralloc and not our KGSL backend. This means we have to get all the information about this surface from gralloc, and if you look in <code>src/egl/driver/dri2/platform_android.c</code> you will see existing code for handing gralloc. You would think “Hey there is no work for me here then”, but you would be wrong. The handle gralloc provides is hardware specific, and the code in <code>platform_android.c</code> assumes a DRM gralloc implementation. Thankfully the turnip developers had already gone through this struggle and if you look in <code>src/freedreno/vulkan/tu_android.c</code> you can see they have implemented a separate path when a Qualcomm msm implementation of gralloc is detected. I could copy this detection logic and add a separate path to <code>platform_android.c</code>.</p>
+<h2 id="working-with-the-freedreno-community">Working with the Freedreno community</h2>
+<p>When working on any project (open-source or otherwise), it’s nice to know that you aren’t working alone. Thankfully the <code>#freedreno</code> channel on <code>irc.oftc.net</code> is very active and full of helpful people to answer any questions you may have. While working on the backend, one area I wasn’t really sure how to address was the synchronization code for buffer objects. The backend exposed a function called <code>cpu_prep</code>, This function was just there to call the DRM implementation of <code>cpu_prep</code> on the buffer object. I wasn’t exactly sure how to implement this functionality with KGSL since it doesn’t use DRM buffer objects.</p>
+<p>I ended up reaching out to the IRC channel and Rob Clark on the channel explained to me that he was actually working on moving a lot of the code for <code>cpu_prep</code> into common code so that a non-drm driver (like the KGSL backend I was working on) would just need to implement that operation as NOP (no operation).</p>
+<h2 id="dealing-with-bugs-reverse-engineering-the-blob">Dealing with bugs &amp; reverse engineering the blob</h2>
+<p>I encountered a few different bugs when implementing the KGSL backend, but most of them consisted of me calling KGSL wrong, or handing synchronization incorrectly. Thankfully since Turnip is already running on KGSL, I could just more carefully compare my code to what Turnip is doing and figure out my logical mistake.</p>
+<p>Some of the bugs I encountered required the backend interface in Freedreno to be modified to expose per a new per driver implementation of that backend function, instead of just using a common implementation. For example the existing function to map a buffer object into userspace assumed that the same <code>fd</code> for the device could be used for the buffer object in the <code>mmap</code> call. This worked fine for any buffer objects we created through KGSL but would not work for buffer objects created from gralloc (remember the above section on surface memory for windows comming from gralloc). To resolve this issue I exposed a new per backend implementation of “map” where I could take a different path if the buffer object came from gralloc.</p>
+<p>While testing the KGSL backend I did encounter a new bug that seems to effect both my new KGSL backend and the Turnip KGSL backend. The bug is an <code>iommu fault</code> that occurs when the surface allocated by gralloc does not have a height that is aligned to 4. The blitting engine on a6xx GPUs copies in 16x4 chunks, so if the height is not aligned by 4 the GPU will try to write to pixels that exists outside the allocated memory. This issue only happens with KGSL backends since we import memory from gralloc, and gralloc allocates exactly enough memory for the surface, with no alignment on the height. If running on any other platform, the <code>fdl</code> (Freedreno Layout) code would be called to compute the minimum required size for a surface which would take into account the alignment requirement for the height. The blob driver Qualcomm didn’t seem to have this problem, even though its getting the exact same buffer from gralloc. So it must be doing something different to handle the none aligned height.</p>
+<p>Because this issue relied on gralloc, the application needed to running as an Android APK to get a surface from gralloc. The best way to fix this issue would be to figure out what the blob driver is doing and try to replicate this behavior in Freedreno (assuming it isn’t doing something silly like switch to sysmem rendering). Unfortunately it didn’t look like the libwrap library worked to trace an APK.</p>
+<p>The libwrap library relied on a linux feature known as <code>LD_PRELOAD</code> to load <code>libwrap.so</code> when the application starts and replace the system functions like <code>open</code> and <code>ioctl</code> with their own implementation that traces what is being submitted to the KGSL kernel mode driver. Thankfully android exposes this <code>LD_PRELOAD</code> mechanism through its “wrap” interface where you create a propety called <code>wrap.&lt;app-name&gt;</code> with a value <code>LD_PRELOAD=&lt;path to libwrap.so&gt;</code>. Android will then load your library like would be done in a normal linux shell. If you tried to do this with libwrap though you find very quickly that you would get corrupted traces. When android launches your APK, it doesn’t only launch your application, there are different threads for different android system related functions and some of them can also use OpenGL. The libwrap library is not designed to handle multiple threads using KGSL at the same time. After discovering this issue I created a <a href="https://gitlab.freedesktop.org/freedreno/freedreno/-/merge_requests/22">MR</a> that would store the tracing file handles as TLS (thread local storage) preventing the clobbering of the trace file, and also allowing you to view the traces generated by different threads separately from each other.</p>
+<p>With this is in hand one could begin investing what the blob driver is doing to handle this unaligned surfaces.</p>
+<h2 id="whats-next">What’s next?</h2>
+<p>Well the next obvious thing to fix is the aligned height issue which is still open. I’ve also worked on upstreaming my changes with this <a href="https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21570">WIP MR</a>.</p>
+<figure>
+<img src="/assets/freedreno/3d-mark.png" alt="Freedreno running 3d-mark" /><figcaption aria-hidden="true">Freedreno running 3d-mark</figcaption>
+</figure>
+ </div>
+</div> </main>
+</body>
+</html>
diff --git a/notes/freedreno_journey.md b/notes/freedreno_journey.md
new file mode 100644
index 0000000..e669421
--- /dev/null
+++ b/notes/freedreno_journey.md
@@ -0,0 +1,72 @@
+---
+layout: post
+title: "Journey Through Freedreno"
+date: "2023-02-28"
+last_edit: "2023-02-28"
+status: 2
+cover_image: "/assets/freedreno/glinfo_freedreno_preview.png"
+categories: igalia graphics
+---
+![Android running Freedreno](/assets/freedreno/glinfo_freedreno.png)
+
+As part of my training at Igalia I've been attempting to write a new backend for Freedreno that targets the proprietary "KGSL" kernel mode driver. For those unaware there are two "main" kernel mode drivers on Qualcomm SOCs for the GPU, there is the "MSM", and "KGSL". "MSM" is DRM compliant, and Freedreno already able to run on this driver. "KGSL" is the proprietary KMD that Qualcomm's proprietary userspace driver targets. Now why would you want to run freedreno against KGSL, when MSM exists? Well there are a few ones, first MSM only really works on an up-streamed kernel, so if you have to run a down-streamed kernel you can continue using the version of KGSL that the manufacturer shipped with your device. Second this allows you to run both the proprietary adreno driver and the open source freedreno driver on the same device just by swapping libraries, which can be very nice for quickly testing something against both drivers.
+
+## When "DRM" isn't just "DRM"
+When working on a new backend, one of the critical things to do is to make use of as much "common code" as possible. This has a number of benefits, least of all reducing the amount of code you have to write. It also allows reduces the number of bugs that will likely exist as you are relying on well tested code, and it ensures that the backend is mostly likely going to continue to work with new driver updates.
+
+When I started the work for a new backend I looked inside mesa's `src/freedreno/drm` folder. This has the current backend code for Freedreno, and its already modularized to support multiple backends. It currently has support for the above mentioned MSM kernel mode driver as well as virtio (a backend that allows Freedreno to be used from within in a virtualized environment). From the name of this path, you would think that the code in this module would only work with kernel mode drivers that implement DRM, but actually there is only a handful of places in this module where DRM support is assumed. This made it a good starting point to introduce the KGSL backend and piggy back off the common code.
+
+For example the `drm` module has a lot of code to deal with the management of synchronization primitives, buffer objects, and command submit lists. All managed at a abstraction above "DRM" and to re-implement this code would be a bad idea.
+
+## How to get Android to behave
+One of this big struggles with getting the KGSL backend working was figuring out how I could get Android to load mesa instead of Qualcomm blob driver that is shipped with the device image. Thankfully a good chunk of this work has already been figured out when the Turnip developers (Turnip is the open source Vulkan implementation for Adreno GPUs) figured out how to get Turnip running on android with KGSL. Thankfully one of my coworkers [Danylo](https://blogs.igalia.com/dpiliaiev/) is one of those Turnip developers, and he gave me a lot of guidance on getting Android setup. One thing to watch out for is the outdated instructions [here](https://docs.mesa3d.org/android.html). These instructions *almost* work, but require some modifications. First if you're using a more modern version of the Android NDK, the compiler has been replaced with LLVM/Clang, so you need to change which compiler is being used. Second flags like `system` in the cross compiler script incorrectly set the system as `linux` instead of `android`. I had success using the below cross compiler script. Take note that the compiler paths need to be updated to match where you extracted the android NDK on your system.
+
+```meson
+[binaries]
+ar = '/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-ar'
+c = ['ccache', '/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang']
+cpp = ['ccache', '/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/aarch64-linux-android29-clang++', '-fno-exceptions', '-fno-unwind-tables', '-fno-asynchronous-unwind-tables', '-static-libstdc++']
+c_ld = 'lld'
+cpp_ld = 'lld'
+strip = '/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/toolchains/llvm/prebuilt/linux-x86_64/bin/llvm-strip'
+# Android doesn't come with a pkg-config, but we need one for Meson to be happy not
+# finding all the optional deps it looks for. Use system pkg-config pointing at a
+# directory we get to populate with any .pc files we want to add for Android
+pkgconfig = ['env', 'PKG_CONFIG_LIBDIR=/home/lfryzek/Documents/projects/igalia/freedreno/android-ndk-r25b-linux/android-ndk-r25b/pkgconfig:/home/lfryzek/Documents/projects/igalia/freedreno/install-android/lib/pkgconfig', '/usr/bin/pkg-config']
+
+[host_machine]
+system = 'android'
+cpu_family = 'arm'
+cpu = 'armv8'
+endian = 'little'
+```
+
+Another thing I had to figure out with Android, that was different with these instructions, was how I would get Android to load mesa versions of mesa libraries. That's when my colleague [Mark](https://www.igalia.com/team/mark) pointed out to me that Android is open source and I could just check the source code myself. Sure enough you have find the OpenGL driver loader in [Android's source code](https://android.googlesource.com/platform/frameworks/native/+/master/opengl/libs/EGL/Loader.cpp). From this code we can that Android will try to load a few different files based on some settings, and in my case it would try to load 3 different shaded libraries in the `/vendor/lib64/egl` folder, `libEGL_adreno.so` ,`libGLESv1_CM_adreno.so`, and `libGLESv2.so`. I could just replace these libraries with the version built from mesa and voilà, you're now loading a custom driver! This realization that I could just "read the code" was very powerful in debugging some more android specific issues I ran into, like dealing with gralloc.
+
+Something cool that the opensource Freedreno & Turnip driver developers figured out was getting android to run test OpenGL applications from the adb shell without building android APKs. If you check out the [freedreno repo](https://gitlab.freedesktop.org/freedreno/freedreno), they have an `ndk-build.sh` script that can build tests in the `tests-*` folder. The nice benefit of this is that it provides an easy way to run simple test cases without worrying about the android window system integration. Another nifty feature about this repo is the `libwrap` tool that lets trace the commands being submitted to the GPU.
+
+## What even is Gralloc?
+Gralloc is the graphics memory allocated in Android, and the OS will use it to allocate the surface for "windows". This means that the memory we want to render the display to is managed by gralloc and not our KGSL backend. This means we have to get all the information about this surface from gralloc, and if you look in `src/egl/driver/dri2/platform_android.c` you will see existing code for handing gralloc. You would think "Hey there is no work for me here then", but you would be wrong. The handle gralloc provides is hardware specific, and the code in `platform_android.c` assumes a DRM gralloc implementation. Thankfully the turnip developers had already gone through this struggle and if you look in `src/freedreno/vulkan/tu_android.c` you can see they have implemented a separate path when a Qualcomm msm implementation of gralloc is detected. I could copy this detection logic and add a separate path to `platform_android.c`.
+
+## Working with the Freedreno community
+When working on any project (open-source or otherwise), it's nice to know that you aren't working alone. Thankfully the `#freedreno` channel on `irc.oftc.net` is very active and full of helpful people to answer any questions you may have. While working on the backend, one area I wasn't really sure how to address was the synchronization code for buffer objects. The backend exposed a function called `cpu_prep`, This function was just there to call the DRM implementation of `cpu_prep` on the buffer object. I wasn't exactly sure how to implement this functionality with KGSL since it doesn't use DRM buffer objects.
+
+I ended up reaching out to the IRC channel and Rob Clark on the channel explained to me that he was actually working on moving a lot of the code for `cpu_prep` into common code so that a non-drm driver (like the KGSL backend I was working on) would just need to implement that operation as NOP (no operation).
+
+## Dealing with bugs & reverse engineering the blob
+I encountered a few different bugs when implementing the KGSL backend, but most of them consisted of me calling KGSL wrong, or handing synchronization incorrectly. Thankfully since Turnip is already running on KGSL, I could just more carefully compare my code to what Turnip is doing and figure out my logical mistake.
+
+Some of the bugs I encountered required the backend interface in Freedreno to be modified to expose per a new per driver implementation of that backend function, instead of just using a common implementation. For example the existing function to map a buffer object into userspace assumed that the same `fd` for the device could be used for the buffer object in the `mmap` call. This worked fine for any buffer objects we created through KGSL but would not work for buffer objects created from gralloc (remember the above section on surface memory for windows comming from gralloc). To resolve this issue I exposed a new per backend implementation of "map" where I could take a different path if the buffer object came from gralloc.
+
+While testing the KGSL backend I did encounter a new bug that seems to effect both my new KGSL backend and the Turnip KGSL backend. The bug is an `iommu fault` that occurs when the surface allocated by gralloc does not have a height that is aligned to 4. The blitting engine on a6xx GPUs copies in 16x4 chunks, so if the height is not aligned by 4 the GPU will try to write to pixels that exists outside the allocated memory. This issue only happens with KGSL backends since we import memory from gralloc, and gralloc allocates exactly enough memory for the surface, with no alignment on the height. If running on any other platform, the `fdl` (Freedreno Layout) code would be called to compute the minimum required size for a surface which would take into account the alignment requirement for the height. The blob driver Qualcomm didn't seem to have this problem, even though its getting the exact same buffer from gralloc. So it must be doing something different to handle the none aligned height.
+
+Because this issue relied on gralloc, the application needed to running as an Android APK to get a surface from gralloc. The best way to fix this issue would be to figure out what the blob driver is doing and try to replicate this behavior in Freedreno (assuming it isn't doing something silly like switch to sysmem rendering). Unfortunately it didn't look like the libwrap library worked to trace an APK.
+
+The libwrap library relied on a linux feature known as `LD_PRELOAD` to load `libwrap.so` when the application starts and replace the system functions like `open` and `ioctl` with their own implementation that traces what is being submitted to the KGSL kernel mode driver. Thankfully android exposes this `LD_PRELOAD` mechanism through its "wrap" interface where you create a propety called `wrap.<app-name>` with a value `LD_PRELOAD=<path to libwrap.so>`. Android will then load your library like would be done in a normal linux shell. If you tried to do this with libwrap though you find very quickly that you would get corrupted traces. When android launches your APK, it doesn't only launch your application, there are different threads for different android system related functions and some of them can also use OpenGL. The libwrap library is not designed to handle multiple threads using KGSL at the same time. After discovering this issue I created a [MR](https://gitlab.freedesktop.org/freedreno/freedreno/-/merge_requests/22) that would store the tracing file handles as TLS (thread local storage) preventing the clobbering of the trace file, and also allowing you to view the traces generated by different threads separately from each other.
+
+With this is in hand one could begin investing what the blob driver is doing to handle this unaligned surfaces.
+
+## What's next?
+Well the next obvious thing to fix is the aligned height issue which is still open. I've also worked on upstreaming my changes with this [WIP MR](https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21570).
+
+![Freedreno running 3d-mark](/assets/freedreno/3d-mark.png)