{"id":1341,"date":"2023-07-10T00:02:01","date_gmt":"2023-07-09T23:02:01","guid":{"rendered":"https:\/\/exponentialdecay.co.uk\/blog\/?p=1341"},"modified":"2025-11-28T08:20:10","modified_gmt":"2025-11-28T08:20:10","slug":"published-fractal-in-detail-what-information-is-in-a-file-format-identification-report","status":"publish","type":"post","link":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/","title":{"rendered":"What information is in a file format identification report?"},"content":{"rendered":"<p>In early 2022, I was finally able to get around to writing a paper that I had been thinking about for the better part of a decade. The paper, &#8220;Fractal in Detail: What Information Is in a File Format Identification Report?&#8221; was published in the <a href=\"https:\/\/journal.code4lib.org\/articles\/16351\" target=\"_blank\" rel=\"noopener\">Code4Lib journal Issue 53<\/a>.<\/p>\n<p>The paper takes a deep dive into the fractal contents of file format identification reports exported from tools like <a href=\"https:\/\/github.com\/richardlehane\/siegfried\" target=\"_blank\" rel=\"noopener\">Siegfried<\/a> and <a href=\"https:\/\/github.com\/digital-preservation\/droid\" target=\"_blank\" rel=\"noopener\">DROID<\/a>.<\/p>\n<p>Let&#8217;s take a brief look the article and its contents below.<\/p>\n<p><!--more--><\/p>\n<p>Given this example from Siegfried, information in file-format reports may look something like as follows:<\/p>\n<pre>---\r\nsiegfried : 1.10.1\r\nscandate : 2023-07-09T16:53:44+02:00\r\nsignature : default.sig\r\ncreated : 2023-05-12T09:10:13Z\r\nidentifiers :\r\n\u00a0 \u00a0- name : 'pronom'\r\n\u00a0 \u00a0 \u00a0details : 'DROID_SignatureFile_V112.xml; container-signature-20230510.xml'\r\n---\r\nfilename : 'sf'\r\nfilesize : 10024103\r\nmodified : 2023-07-09T16:20:20+02:00\r\nerrors :\r\nmd5 : b08e809832955674c801559f7a9adf17\r\nmatches :\r\n\u00a0 \u00a0- ns : 'pronom'\r\n\u00a0 \u00a0 \u00a0id : 'fmt\/690'\r\n\u00a0 \u00a0 \u00a0format : 'Executable and Linkable Format'\r\n\u00a0 \u00a0 \u00a0version : '64bit Little Endian'\r\n\u00a0 \u00a0 \u00a0mime :\r\n\u00a0 \u00a0 \u00a0class :\r\n\u00a0 \u00a0 \u00a0basis : 'byte match at 0, 7'\r\n<\/pre>\n<p>Over a large enough corpus of files this information reveals so much about a collection. That information includes:<\/p>\n<ul>\n<li>Range of format identification.<\/li>\n<li>Unidentified file formats as an indicator of further work.<\/li>\n<li>Identified file formats as an indicator of unwanted files.<\/li>\n<li>Identified file formats as an indicator of the complexity of a collection.<\/li>\n<li>File and directory names (and their potential to be analysed).<\/li>\n<li>Encoding information.<\/li>\n<li>Empty directories.<\/li>\n<li>File Sizes.<\/li>\n<li>Date files were last modified.<\/li>\n<li>Information about zero-byte files.<\/li>\n<li>Checksum analysis and duplicate detection.<\/li>\n<li>Information about System files.<\/li>\n<\/ul>\n<p>With a consistent abstraction for viewing this data, one can document a collection in great detail, draw connections between other collections, and identify new work programs involved in maintaining a digital collection over the longest period of time.<\/p>\n<p>I have been working to extract this information from file format identification reports in a consistent repeatable way in my tool <a href=\"https:\/\/github.com\/exponential-decay\/demystify\" target=\"_blank\" rel=\"noopener\">Demystify<\/a>, and <a href=\"https:\/\/ross-spencer.github.io\/demystify-lite\/\" target=\"_blank\" rel=\"noopener\">Demystify-Lite<\/a> (see also <a href=\"https:\/\/exponentialdecay.co.uk\/blog\/client-side-identification-and-reporting-pipeline-with-siegfried-and-demystify-lite\/\" target=\"_blank\" rel=\"noopener\">the blog about that effort<\/a>).<\/p>\n<p>A file format identification report is an important artifact that often exists at the beginning of a digital transfer process and may then be recreated a number of times as the collection is processed. My paper goes into a lot more detail about how you might use the information in it and looks at some of the other tools out there that are already trying to do that.<\/p>\n<p>The paper received a lot of positive comments at the time of publishing.\u00a0Let me know what you think about it and if you have other ideas about how you might leverage format identification reports in your day-to-day work.<\/p>\n<p>Read more in <a href=\"https:\/\/journal.code4lib.org\/articles\/16351\" target=\"_blank\" rel=\"noopener\">Code4L`ib issue 53<\/a>.<\/p>\n<h2><em><strong>Epilogue<\/strong><\/em><\/h2>\n<p><em>Since the publication of this paper I have written a new tool that uses the checksums output in a file format identification report to create a checksum for a &#8220;folder&#8221; or directory where checksums for those do not normally exist. This tool is called <a href=\"https:\/\/github.com\/ross-spencer\/sumfolder1\" target=\"_blank\" rel=\"noopener\">Sumfolder1<\/a> and I will be introducing it in more detail in a <a href=\"https:\/\/exponentialdecay.co.uk\/blog\/signposting-what-is-the-checksum-of-a-directory-introducing-sumfolder1\/\" target=\"_blank\" rel=\"noopener\">later blog<\/a>.\u00a0<\/em><\/p>\n<div class=\"pvc_clear\"><\/div>\n<p id=\"pvc_stats_1341\" class=\"pvc_stats total_only  \" data-element-id=\"1341\" style=\"\"><i class=\"pvc-stats-icon small\" aria-hidden=\"true\"><svg aria-hidden=\"true\" focusable=\"false\" data-prefix=\"far\" data-icon=\"chart-bar\" role=\"img\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\" class=\"svg-inline--fa fa-chart-bar fa-w-16 fa-2x\"><path fill=\"currentColor\" d=\"M396.8 352h22.4c6.4 0 12.8-6.4 12.8-12.8V108.8c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v230.4c0 6.4 6.4 12.8 12.8 12.8zm-192 0h22.4c6.4 0 12.8-6.4 12.8-12.8V140.8c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v198.4c0 6.4 6.4 12.8 12.8 12.8zm96 0h22.4c6.4 0 12.8-6.4 12.8-12.8V204.8c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v134.4c0 6.4 6.4 12.8 12.8 12.8zM496 400H48V80c0-8.84-7.16-16-16-16H16C7.16 64 0 71.16 0 80v336c0 17.67 14.33 32 32 32h464c8.84 0 16-7.16 16-16v-16c0-8.84-7.16-16-16-16zm-387.2-48h22.4c6.4 0 12.8-6.4 12.8-12.8v-70.4c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v70.4c0 6.4 6.4 12.8 12.8 12.8z\" class=\"\"><\/path><\/svg><\/i> <img loading=\"lazy\" decoding=\"async\" width=\"16\" height=\"16\" alt=\"Loading\" src=\"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/plugins\/page-views-count\/ajax-loader-2x.gif\" border=0 \/><\/p>\n<div class=\"pvc_clear\"><\/div>\n","protected":false},"excerpt":{"rendered":"<p>In early 2022, I was finally able to get around to writing a paper that I had been thinking about for the better part of a decade. The paper, &#8220;Fractal in Detail: What Information Is in a File Format Identification Report?&#8221; was published in the <a href=\"https:\/\/journal.code4lib.org\/articles\/16351\" target=\"_blank\" rel=\"noopener\">Code4Lib journal Issue 53<\/a>.<\/p>\n<p>The paper takes a deep dive into the fractal contents of file format identification reports exported from tools like <a href=\"https:\/\/github.com\/richardlehane\/siegfried\" target=\"_blank\" rel=\"noopener\">Siegfried<\/a> and <a href=\"https:\/\/github.com\/digital-preservation\/droid\" target=\"_blank\" rel=\"noopener\">DROID<\/a>.<\/p>\n<p>Let&#8217;s take a brief look the article and its contents below.<\/p>\n<div class=\"link-more\"><a href=\"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &ldquo;What information is in a file format identification report?&rdquo;<\/span>&hellip;<\/a><\/div>\n<div class=\"pvc_clear\"><\/div>\n<p id=\"pvc_stats_1341\" class=\"pvc_stats total_only  \" data-element-id=\"1341\" style=\"\"><i class=\"pvc-stats-icon small\" aria-hidden=\"true\"><svg aria-hidden=\"true\" focusable=\"false\" data-prefix=\"far\" data-icon=\"chart-bar\" role=\"img\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 512 512\" class=\"svg-inline--fa fa-chart-bar fa-w-16 fa-2x\"><path fill=\"currentColor\" d=\"M396.8 352h22.4c6.4 0 12.8-6.4 12.8-12.8V108.8c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v230.4c0 6.4 6.4 12.8 12.8 12.8zm-192 0h22.4c6.4 0 12.8-6.4 12.8-12.8V140.8c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v198.4c0 6.4 6.4 12.8 12.8 12.8zm96 0h22.4c6.4 0 12.8-6.4 12.8-12.8V204.8c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v134.4c0 6.4 6.4 12.8 12.8 12.8zM496 400H48V80c0-8.84-7.16-16-16-16H16C7.16 64 0 71.16 0 80v336c0 17.67 14.33 32 32 32h464c8.84 0 16-7.16 16-16v-16c0-8.84-7.16-16-16-16zm-387.2-48h22.4c6.4 0 12.8-6.4 12.8-12.8v-70.4c0-6.4-6.4-12.8-12.8-12.8h-22.4c-6.4 0-12.8 6.4-12.8 12.8v70.4c0 6.4 6.4 12.8 12.8 12.8z\" class=\"\"><\/path><\/svg><\/i> <img loading=\"lazy\" decoding=\"async\" width=\"16\" height=\"16\" alt=\"Loading\" src=\"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/plugins\/page-views-count\/ajax-loader-2x.gif\" border=0 \/><\/p>\n<div class=\"pvc_clear\"><\/div>\n","protected":false},"author":1,"featured_media":1344,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":3,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"federated","footnotes":""},"categories":[86,114,3,168],"tags":[183,403,147,71,15,372,191,17,402,184,401,198,194,195,16,192,193,185,197,196],"class_list":["post-1341","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-archives","category-digital-literacy","category-digital-preservation","category-publications","tag-code4lib","tag-code4lib-journal","tag-digipres","tag-digital-preservation","tag-droid","tag-file-format-analysis","tag-file-format-identification","tag-file-formats","tag-filedriller","tag-format-identification","tag-freud","tag-linting","tag-metadata","tag-preservation-metadata","tag-pronom","tag-puid","tag-puids","tag-siegfried","tag-static-analysis","tag-technical-metadata","entry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>What information is in a file format identification report?<\/title>\n<meta name=\"description\" content=\"A deep dive into the fractal contents of file format identification reports and how they can be used for digital preservation.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What information is in a file format identification report?\" \/>\n<meta property=\"og:description\" content=\"A deep dive into the fractal contents of file format identification reports and how they can be used for digital preservation.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/\" \/>\n<meta property=\"og:site_name\" content=\"ross spencer :: exponentialdecay.digipres :: blog\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-09T23:02:01+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-11-28T08:20:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2023\/07\/fractal-in-detail.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1293\" \/>\n\t<meta property=\"og:image:height\" content=\"542\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Ross Spencer\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@beet_keeper\" \/>\n<meta name=\"twitter:site\" content=\"@beet_keeper\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Ross Spencer\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/\"},\"author\":{\"name\":\"Ross Spencer\",\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/4cae0a954400f42b9c1b70c699837716\"},\"headline\":\"What information is in a file format identification report?\",\"datePublished\":\"2023-07-09T23:02:01+00:00\",\"dateModified\":\"2025-11-28T08:20:10+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/\"},\"wordCount\":434,\"commentCount\":5,\"publisher\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/4cae0a954400f42b9c1b70c699837716\"},\"image\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/fractal-in-detail.png\",\"keywords\":[\"Code4Lib\",\"Code4Lib Journal\",\"digipres\",\"Digital Preservation\",\"DROID\",\"File Format Analysis\",\"File Format Identification\",\"File Formats\",\"FileDriller\",\"format identification\",\"Freud\",\"Linting\",\"Metadata\",\"Preservation Metadata\",\"PRONOM\",\"PUID\",\"PUIDS\",\"siegfried\",\"Static Analysis\",\"Technical Metadata\"],\"articleSection\":[\"Archives\",\"Digital Literacy\",\"Digital Preservation\",\"Publications\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/\",\"url\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/\",\"name\":\"What information is in a file format identification report?\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/fractal-in-detail.png\",\"datePublished\":\"2023-07-09T23:02:01+00:00\",\"dateModified\":\"2025-11-28T08:20:10+00:00\",\"description\":\"A deep dive into the fractal contents of file format identification reports and how they can be used for digital preservation.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#primaryimage\",\"url\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/fractal-in-detail.png\",\"contentUrl\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/fractal-in-detail.png\",\"width\":1293,\"height\":542,\"caption\":\"Abstract from Fractal in Detail: What information is in a file format identification report from the Code4Lib Journal.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"What information is in a file format identification report?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/\",\"name\":\"ross spencer :: exponentialdecay.digipres :: blog\",\"description\":\"Digital preservation analyst, researcher, and software developer\",\"publisher\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/4cae0a954400f42b9c1b70c699837716\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/4cae0a954400f42b9c1b70c699837716\",\"name\":\"Ross Spencer\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2025\\\/06\\\/avatar-scaled.png\",\"url\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2025\\\/06\\\/avatar-scaled.png\",\"contentUrl\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2025\\\/06\\\/avatar-scaled.png\",\"width\":2560,\"height\":2560,\"caption\":\"Ross Spencer\"},\"logo\":{\"@id\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/wp-content\\\/uploads\\\/2025\\\/06\\\/avatar-scaled.png\"},\"description\":\"Digital preservation domain expert and full-stack software developer.\",\"sameAs\":[\"http:\\\/\\\/www.exponentialdecay.co.uk\\\/blog\",\"https:\\\/\\\/www.instagram.com\\\/b33tk33p3r\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/in\\\/ross-spencer-b6b9b758\\\/\",\"https:\\\/\\\/x.com\\\/beet_keeper\"],\"url\":\"https:\\\/\\\/exponentialdecay.co.uk\\\/blog\\\/author\\\/exponentialdecay\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"What information is in a file format identification report?","description":"A deep dive into the fractal contents of file format identification reports and how they can be used for digital preservation.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/","og_locale":"en_US","og_type":"article","og_title":"What information is in a file format identification report?","og_description":"A deep dive into the fractal contents of file format identification reports and how they can be used for digital preservation.","og_url":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/","og_site_name":"ross spencer :: exponentialdecay.digipres :: blog","article_published_time":"2023-07-09T23:02:01+00:00","article_modified_time":"2025-11-28T08:20:10+00:00","og_image":[{"width":1293,"height":542,"url":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2023\/07\/fractal-in-detail.png","type":"image\/png"}],"author":"Ross Spencer","twitter_card":"summary_large_image","twitter_creator":"@beet_keeper","twitter_site":"@beet_keeper","twitter_misc":{"Written by":"Ross Spencer","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#article","isPartOf":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/"},"author":{"name":"Ross Spencer","@id":"https:\/\/exponentialdecay.co.uk\/blog\/#\/schema\/person\/4cae0a954400f42b9c1b70c699837716"},"headline":"What information is in a file format identification report?","datePublished":"2023-07-09T23:02:01+00:00","dateModified":"2025-11-28T08:20:10+00:00","mainEntityOfPage":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/"},"wordCount":434,"commentCount":5,"publisher":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/#\/schema\/person\/4cae0a954400f42b9c1b70c699837716"},"image":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#primaryimage"},"thumbnailUrl":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2023\/07\/fractal-in-detail.png","keywords":["Code4Lib","Code4Lib Journal","digipres","Digital Preservation","DROID","File Format Analysis","File Format Identification","File Formats","FileDriller","format identification","Freud","Linting","Metadata","Preservation Metadata","PRONOM","PUID","PUIDS","siegfried","Static Analysis","Technical Metadata"],"articleSection":["Archives","Digital Literacy","Digital Preservation","Publications"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/","url":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/","name":"What information is in a file format identification report?","isPartOf":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#primaryimage"},"image":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#primaryimage"},"thumbnailUrl":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2023\/07\/fractal-in-detail.png","datePublished":"2023-07-09T23:02:01+00:00","dateModified":"2025-11-28T08:20:10+00:00","description":"A deep dive into the fractal contents of file format identification reports and how they can be used for digital preservation.","breadcrumb":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#primaryimage","url":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2023\/07\/fractal-in-detail.png","contentUrl":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2023\/07\/fractal-in-detail.png","width":1293,"height":542,"caption":"Abstract from Fractal in Detail: What information is in a file format identification report from the Code4Lib Journal."},{"@type":"BreadcrumbList","@id":"https:\/\/exponentialdecay.co.uk\/blog\/published-fractal-in-detail-what-information-is-in-a-file-format-identification-report\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/exponentialdecay.co.uk\/blog\/"},{"@type":"ListItem","position":2,"name":"What information is in a file format identification report?"}]},{"@type":"WebSite","@id":"https:\/\/exponentialdecay.co.uk\/blog\/#website","url":"https:\/\/exponentialdecay.co.uk\/blog\/","name":"ross spencer :: exponentialdecay.digipres :: blog","description":"Digital preservation analyst, researcher, and software developer","publisher":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/#\/schema\/person\/4cae0a954400f42b9c1b70c699837716"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/exponentialdecay.co.uk\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/exponentialdecay.co.uk\/blog\/#\/schema\/person\/4cae0a954400f42b9c1b70c699837716","name":"Ross Spencer","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2025\/06\/avatar-scaled.png","url":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2025\/06\/avatar-scaled.png","contentUrl":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2025\/06\/avatar-scaled.png","width":2560,"height":2560,"caption":"Ross Spencer"},"logo":{"@id":"https:\/\/exponentialdecay.co.uk\/blog\/wp-content\/uploads\/2025\/06\/avatar-scaled.png"},"description":"Digital preservation domain expert and full-stack software developer.","sameAs":["http:\/\/www.exponentialdecay.co.uk\/blog","https:\/\/www.instagram.com\/b33tk33p3r\/","https:\/\/www.linkedin.com\/in\/ross-spencer-b6b9b758\/","https:\/\/x.com\/beet_keeper"],"url":"https:\/\/exponentialdecay.co.uk\/blog\/author\/exponentialdecay\/"}]}},"views":1470,"_links":{"self":[{"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/posts\/1341","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/comments?post=1341"}],"version-history":[{"count":8,"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/posts\/1341\/revisions"}],"predecessor-version":[{"id":2777,"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/posts\/1341\/revisions\/2777"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/media\/1344"}],"wp:attachment":[{"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/media?parent=1341"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/categories?post=1341"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/exponentialdecay.co.uk\/blog\/wp-json\/wp\/v2\/tags?post=1341"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}