https://multimedia.cx/eggs/understanding-the-dreamcast-gd-rom-layout/ [Nuvola_apps_multi] Breaking Eggs And Making Omelettes Topics On Multimedia Technology and Reverse Engineering Links * About Multimedia Mike... * About This Blog... * Clone2727's Blog * Coding Horror * Email Multimedia Mike (mike-at-multimedia.cx) * Flameeye's Weblog * Gaming Pathology * ginger's thoughts * Hardwarebug * Kostya's Wild Codec World * Lair Of The Multimedia Guru * Multimedia Exploration Journal * robert.swain Categories: * Big Data (2) * call/ret Monitor (5) * Cirrus Retro (1) * Codec Technology (18) * DRM (9) * FATE Server (156) * Fraps FPS1 (2) * Game Hacking (73) * General (217) * History (1) * HTML5 (6) * IDA Pro (3) * Java (19) * Legal/Ethical (8) * Multimedia Goals And TODO (7) * Multimedia History (2) * Multimedia PressWatch (38) * Nintendo (18) * On2/Duck (21) * Open Source Multimedia (154) * Origin Xan (3) * Outlandish Brainstorms (25) * PAVC (12) * Pickover Puzzles (9) * Programming (43) * Python (28) * Reverse Engineering (132) * Robots Of The 80s (11) * Science Projects (6) * Sega Dreamcast (20) * Software Museum (5) * Vector Quantization (8) * Video Codecs (13) * VP3/Theora (18) * VP8 (20) * VTech V.Flash (4) * Windows Media (11) * xbox (8) * ZeldaClassic (3) Search: * [ ] [Search] Archives: * March 2022 (1) * October 2021 (1) * May 2021 (1) * December 2020 (1) * January 2019 (1) * December 2018 (1) * December 2017 (1) * January 2017 (1) * August 2016 (2) * June 2016 (1) * May 2016 (1) * January 2016 (1) * August 2015 (1) * April 2015 (1) * February 2015 (1) * December 2014 (1) * November 2014 (1) * September 2014 (1) * August 2014 (2) * July 2014 (1) * June 2014 (1) * February 2014 (2) * January 2014 (1) * December 2013 (1) * November 2013 (1) * October 2013 (1) * September 2013 (1) * July 2013 (1) * June 2013 (3) * May 2013 (1) * April 2013 (3) * March 2013 (1) * February 2013 (1) * January 2013 (1) * December 2012 (1) * November 2012 (1) * October 2012 (2) * September 2012 (2) * August 2012 (2) * July 2012 (4) * June 2012 (2) * May 2012 (2) * April 2012 (4) * March 2012 (4) * February 2012 (2) * January 2012 (2) * December 2011 (3) * November 2011 (2) * October 2011 (2) * September 2011 (6) * August 2011 (3) * July 2011 (2) * June 2011 (7) * May 2011 (9) * April 2011 (3) * March 2011 (4) * February 2011 (7) * January 2011 (4) * December 2010 (2) * November 2010 (6) * October 2010 (6) * September 2010 (8) * August 2010 (16) * July 2010 (5) * June 2010 (8) * May 2010 (6) * April 2010 (12) * March 2010 (9) * February 2010 (19) * January 2010 (13) * December 2009 (6) * November 2009 (5) * October 2009 (6) * September 2009 (10) * August 2009 (8) * July 2009 (9) * June 2009 (9) * May 2009 (14) * April 2009 (9) * March 2009 (14) * February 2009 (12) * January 2009 (22) * December 2008 (18) * November 2008 (4) * October 2008 (6) * September 2008 (17) * August 2008 (13) * July 2008 (6) * June 2008 (3) * May 2008 (10) * April 2008 (10) * March 2008 (13) * February 2008 (9) * January 2008 (13) * December 2007 (2) * November 2007 (25) * October 2007 (8) * September 2007 (6) * August 2007 (5) * July 2007 (6) * June 2007 (4) * May 2007 (4) * April 2007 (17) * March 2007 (17) * February 2007 (9) * January 2007 (6) * December 2006 (14) * November 2006 (15) * October 2006 (7) * September 2006 (9) * August 2006 (23) * July 2006 (10) * June 2006 (10) * May 2006 (10) * April 2006 (21) * March 2006 (22) * February 2006 (14) * January 2006 (14) * December 2005 (14) * November 2005 (17) * October 2005 (10) * September 2005 (6) * August 2005 (12) * July 2005 (13) * June 2005 (3) * May 2005 (13) * April 2005 (8) * March 2005 (15) * February 2005 (5) * January 2005 (21) Meta: * RSS * Comments RSS * WordPress Understanding The Dreamcast GD-ROM Layout [timeicon] March 23rd, 2022 by [author] Multimedia Mike I'm finally completing something I set out to comprehend over a decade ago. I wanted to understand how data is actually laid out on a Sega Dreamcast GD-ROM drive. I'm trying to remember why I even still care. There was something about how I wanted to make sure the contents of a set of Dreamcast demo discs was archived for study. Lot of 9 volumes of the Official Sega Dreamcast Magazine I eventually figured it out. Read on, if you are interested in the technical details. Or, if you would like to examine the fruits of this effort, check out the Dreamcast demo discs that I took apart and uploaded to the Internet Archive. Motivation Why do I still care about this? Well, see the original charter of this blog above. It's mostly about studying multimedia formats, as well as the general operation of games and their non-multimedia data formats. It's also something that has nagged at me ever since I extracted a bunch of Dreamcast discs years ago and tried to understand why the tracks were arranged the way they were, and how I could systematically split the files out of the filesystem. This turns out not to be as easy as it might sound, even if you can get past the obstacle of getting at the raw data. CD/CD-ROM Refresher As I laid out in my Grand Unified Theory of Compact Disc, every compact disc can be view conceptually as a string of sectors, where each sector is 2352 bytes long. The difference among the various CD types (audio CDs, various CD-ROM types) boils down to the format of contents of the 2352-byte sectors. For an audio CD, every sector's 2352 bytes represents 1/75 of a second of CD-quality audio samples. Meanwhile, there are various sector layouts for different CD-ROM modes, useful for storing computer data. This post is most interested in "mode 1/form 1", which uses 2048 of the 2352 bytes for data, while using the remaining bytes for error detection and correction codes. A filesystem (usually ISO-9660) is overlaid on these 2048-byte sectors in order to create data structures for organizing strings of sectors into files. A CD has between 1 and 99 tracks. A pure CD-ROM will have a single data track. Pure audio CDs tend to have numerous audio tracks, usually 1 per song. Mixed CDs are common. For software, this usually manifests as the first track being data and containing an ISO-9660 filesystem, followed by a series of audio tracks, sometimes for in-game music. For audio CDs, there is occasionally a data track at the end of the disc with some extra media types. GD-ROM Refresher The Dreamcast used optical discs called GD-ROMs, where the GD stands for "gigadisc". These discs were designed to hold about 1 gigabyte of data, vs. the usual 650-700MB offered by standard CD solutions, while using the same laser unit as is used for CDs. I'm not sure how it achieved this exactly. I always assumed it was some sort of "double density" sector scheme. According to Wikipedia, the drive read the disc at a slower rate which allowed it to read more data (presumably the "pits" vs. "lands" which comprise the surface of an optical disc). This might be equivalent to my theory. The GD-ROM discs cannot be read in a standard optical drive. It is necessary to get custom software onto the Dreamcast which will ask the optical hardware to extract the sectors and exfiltrate them off of the unit somehow. There are numerous methods for this. Alternatively, just find rips that are increasingly plentiful around the internet. However, just because you might be able to find the data for a given disc does not mean that you can easily explore the contents. Typical Layout Patterns Going back to my study of the GD-ROM track layouts, 2 clear patterns emerge: All of the game data is packed into track 3: GD-ROM Layout Type 1 Track 3 has data, the last track has data, and the tracks in between contain standard CD audio: GD-ROM Layout Type 2 Also, the disc is always, always 100% utilized. Track 1 always contains an ISO-9660 filesystem and can be read by any standard CD-ROM drive. And it usually has nothing interesting. Track 3 also contains what appears to be an ISO-9660 filesystem. However, if you have a rip of the track and try to mount the image with standard tools, it will not work. In the second layout, the data follows no obvious format. Cracking The Filesystem Code I figured out quite a few years ago that in the case of the consolidated data track 3, that's simply a standard ISO-9660 filesystem that would work fine with standard ISO-9660 reading software... if the data track were located beginning at sector 45000. The filesystem data structures contain references to absolute sector numbers. Thus, if it were possible to modify some ISO-9660 software to assume the first sector is 45000, it ought to have no trouble interpreting the data. ISO-9660 In A Single Track How about the split data track format? Actually, it works the same way. If all the data were sitting on its original disc, track 3 would have data structures pointing to strings of contiguous sectors (extents) in the final track, and those are the files. To express more succinctly: track 3 contains the filesystem root structure and the directory structures, while the final track contains the actual file data. How is the filesystem always 100% full? Track 3 gets padded out with 0-sectors until the beginning of any audio sectors. ISO-9660 Spread Across 2 Tracks Why Lay Things Out Like This? Why push the data as far out on the disc as possible? A reasonable explanation for this would be for read performance. Compact discs operate on Constant Linear Velocity (CLV), vs. Constant Angular Velocity (CAV). The implication of this is that data on the outside of the disc is read faster than data on the inside. I once profiled this characteristic in order to prove it to myself, using both PC CD drives as well as a Dreamcast. By pushing the data to the outer sectors, graphical data gets loaded into RAM faster, and full motion videos, which require a certain minimum bitrate for a good experience, have a better guarantee that playback will be smooth. Implications For Repacking Once people figured out how to boot burned CDs in the Dreamcast, they had a new problem: Squeeze as much as 1 gigabyte down to around 650 megabytes at the most. It looks like the most straightforward strategy was to simply rework the filesystem to remove the often enormous amount of empty space in track 3. My understanding is that another major strategy is to re-encode certain large assets. Full motion video (FMV) assets are a good target here since the prevailing FMV middleware format used on Sega Dreamcast games was Sofdec, which is basically just MPEG-1 video. There is ample opportunity to transcode these files to lower bitrate settings to squeeze some bits (and a lot of visual quality) out of them. Further, if you don't really care about the audio tracks, you could just replace them with brief spurts of silence. Making A Tool So I could make a tool that would process these collections of files representing a disc. I could also adapt it for various forms that a Dreamcast rip might take (I have found at least 3 so far). I could eventually expand it to handle lots of other disc formats (you know, something like Aaru does these days). And that would have been my modus operandi perhaps 10 or more years ago. And of course, the ambitious tool would have never seen daylight as I got distracted by other ideas. I wanted to get a solution up and running as quickly as possible this time. Here was my initial brainstorm: assemble all the tracks into a single, large disc while pretending the audio tracks consist of 2048-byte sectors. In doing so, I ought to be able to use fuseiso to mount the giant image, with a modification to look for the starting sector at a somewhat nonstandard location. To achieve the first part I wrote a quick Python script that processed the contents of a GDI file, which was stored alongside the ISO (data) and RAW (audio) track track rips from when I extracted the disc. The GDI is a very matter-of-fact listing of the tracks and their properties, e.g.: 5 1 0 4 2048 track01.iso 0 2 721 0 2352 track02.raw 0 3 45000 4 2048 track03.iso 0 4 338449 0 2352 track04.raw 0 5 349096 4 2048 track05.iso 0 track number / starting sector / track type (4=data, 0=audio) / bytes per sector / filename / ?? The script skips the first 2 filenames, instead writing 45000 zero sectors in order to simulate the CD-compatible area. Then, for each file, if it's an ISO, append the data to the final data file; if it's audio, compute the number of sectors occupied, and then append that number of 2048-byte zero sectors to the final data file. Finally, to interpret the filesystem, I used an old tool that I've relied upon for a long time- fuseiso. This is a program that leverages Filesystem in Userspace (FUSE) to mount ISO-9660 filesystems as part of the local filesystem, without needing root privileges. The original source hasn't been updated for 15 years, but I found a repo that attempts to modernize it slightly. I forked a version which fixes a few build issues. Anyway, I just had to update a table to ask it to start looking for the root ISO-9660 filesystem at a different location than normal. Suddenly, after so many years, I was able to freely browse a GD-ROM filesystem directly under Linux! Conclusion And Next Steps I had to hack the fuseiso3 tool a bit in order to make this work. I don't think it's especially valuable to make sure anyone can run with the same modifications since the tool assumes that a GD-ROM rip has been processed through the exact pipeline I described above. I have uploaded all of the North American Dreamcast demo discs to archive.org. See this post for a more granular breakdown of what this entails. In the course of this exercise, I also found some European demo discs that could use the same extraction. What else? Should I perform the same extraction experiment for all known Dreamcast games? Would anyone care? Maybe if there's a demand for it. I plan to do a followup on the interesting and weird things I have found on these discs so far. Posted in Sega Dreamcast | [comments] 1 Comment >> One Response 1. Archival Log: Dreamcast Demo Discs | Gaming Pathology Says: March 23rd, 2022 at 11:26 pm [...] I finally got around the archiving the various Dreamcast Magazine demo discs (at least the US versions). If you care about the technical details of how I accomplished this, I have written up the details over on my programming blog. [...] Leave a Comment [ ] Name [ ] Mail (will not be published) [ ] Website [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] Please note: Comment moderation is enabled and may delay your comment. There is no need to resubmit your comment. [Submit Comment]