digitalcorpora.org Open in urlscan Pro
173.236.181.133  Public Scan

Submitted URL: https://www.digitalcorpora.org/
Effective URL: https://digitalcorpora.org/
Submission: On November 10 via manual from US — Scanned from DE

Form analysis 4 forms found in the DOM

GET https://digitalcorpora.org/

<form method="get" class="searchform" action="https://digitalcorpora.org/">
  <input type="text" class="field" name="s" value="">
</form>

GET https://downloads.digitalcorpora.org/search

<form action="https://downloads.digitalcorpora.org/search" method="get">
  <div class="wp-block-search__inside-wrapper">
    <input type="text" class="textfield" id="wp-block-search__input-2" size="40" name="q" value=""> <button type="submit" class="wp-block-search__button">Search S3 Downloads</button>
  </div>
</form>

GET https://digitalcorpora.org/

<form role="search" method="get" action="https://digitalcorpora.org/" class="wp-block-search__button-outside wp-block-search__text-button wp-block-search"><label for="wp-block-search__input-1" class="wp-block-search__label">Search</label>
  <div class="wp-block-search__inside-wrapper "><input type="search" id="wp-block-search__input-1" class="wp-block-search__input wp-block-search__input" name="s" value="" placeholder="" required=""><button type="submit"
      class="wp-block-search__button wp-element-button">Search</button></div>
</form>

GET https://downloads.digitalcorpora.org/search

<form action="https://downloads.digitalcorpora.org/search" method="get">
  <div class="wp-block-search__inside-wrapper">
    <input type="text" class="textfield" id="wp-block-search__input-2" name="q" value=""> <button type="submit" class="wp-block-search__button">Search</button>
  </div>
</form>

Text Content

Skip to content


DIGITAL CORPORA

Producing the Digital Body

 * About DigitalCorpora
   * S3 Information
   * SHA2-256 and SHA3-256
   * Bibliography
   * Sitemap
   * Contact
   * Terms of Use
 * Home
 * Corpora
   * Cell Phones
     * Android 10
     * Android 7
     * Android 8
     * Android 9
     * iOS 13
   * Disk Images
     * Format Conversion
     * nps-2010-emails
     * nps-2014-usb-nondeterministic
     * Real Data Corpus
     * Real Data Corpus FAQ
   * Govdocs1
     * Govdocs1 – Simple Statistical Report
     * Search Govdocs1
   * Network Packet Dumps
   * Packet Dumps
   * Scenarios
     * 2008 Nitroba University Harassment Scenario
     * 2009 M57-Jean
     * 2009 M57-Patents Scenario
     * 2011 NPS Language Drives
     * 2012 National Gallery DC Attack
     * 2018 Lone Wolf Scenario
     * 2019 Narcos
     * 2019 Owl
     * 2019 Owl
     * 2019 Tuck
     * Obtaining Solutions
   * SQL
     * SQLite Forensic Corpus
   * Under Development




HOME


DigitalCorpora.org is a website of digital corpora for use in computer forensics
education research. All of the disk images, memory dumps, and network packet
captures available on this website are freely available and may be used without
prior authorization or IRB approval. We also have available a research corpus of
real data acquired from around the world. Use of that dataset is possible under
special arrangement.

From here you can view the available:

 * Cell Phone Dumps
 * Disk Images
 * Files
 * Network Packet Dumps
 * Scenarios

Most of the disk images are distributed in EnCase E01 format. We also make
available a Digital Forensics XML file for many of the disk images that
describes the files contained within each volume, and packets in PCAP format.
Other files are available as well.


SEARCH THE CORPUS!

You can now search the corpus directly by name. The search results will show up
to a thousand matching files and let you download the file directly or browse
the directory in which it is contained:

Search S3 Downloads


BROWSE THE CORPUS!

All of our site data is stored in the Amazon S3 bucket s3://digitalcorpora/. You
can download from that bucket. We recommend using the bucket directly in
Amazon’s cloud. We get free data storage and transfer from Amazon as part of the
Amazon Open Data Program, for which we are thankful!

You can browse the S3 bucket directly using our JavaScript-based browser here:
[S3 Browser]. It is fast, but it will not work with wget -r to download many
files at once.

You can also browse using our server-based S3 gateway: [S3 Gateway]. It’s
written in python. You can find the source code here.

You can also access this resource from the AWS command line interface with the
s3 ls command:

$ aws s3 ls s3://digitalcorpora/corpora/
                           PRE bin/
                           PRE drives/
                           PRE drives_bulk_extractor/
                           PRE drives_dfxml/
                           PRE files/
                           PRE hashes/
                           PRE mobile/
                           PRE packets/
                           PRE ram/
                           PRE scenarios/
                           PRE sql/
2020-11-21 10:56:19         43 README.txt
2020-11-21 10:56:20    1783404 digitalcorpora.org-hashdeep-2020-04-01.csv
2020-11-21 10:56:19    1787101 digitalcorpora.org-hashdeep-2020-05-01.csv
2020-11-21 10:56:19    1794086 digitalcorpora.org-hashdeep-2020-06-01.csv
2020-11-21 10:56:19    1794914 digitalcorpora.org-hashdeep-2020-07-01.csv
2020-11-21 10:56:20    1796103 digitalcorpora.org-hashdeep-2020-08-01.csv
2020-11-21 10:56:20    1796275 digitalcorpora.org-hashdeep-2020-09-01.csv
2020-11-21 10:56:20    1796447 digitalcorpora.org-hashdeep-2020-10-01.csv
2020-11-21 10:56:20    1796619 digitalcorpora.org-hashdeep-2020-11-01.csv
$



PUBLICATIONS

Publications describing these corpora and our related research can be found on
our publications page.


TEACHER’S SOLUTIONS

Some of our scenarios have solutions available! In general, solutions are
restricted to:

 * Faculty members of accredited non-profit educational institutions.
 * Individuals who are employees of the US Government or US Government
   contractors who are engaged in digital forensics training or research.

In some circumstances, the teacher’s solutions will also be made available to
individuals working with foreign partners of the US government.

Solutions are distributed as PDF and ZIP files that are encrypted with a
password; they are only available to faculty at accredited educational
institutions and employees of government or law enforcement organizations that
are working as researchers or trainers.

Information on obtaining the solutions can be found here: Obtaining Solutions.


RECENT NEWS

 * New Android 11 and 12 Images!2022-09-06 00:00:23
 * 19 New Scenarios!2022-07-24 00:52:01
 * New Wordpress2022-07-24 00:18:17
 * Downloads has been transitioned from GMU to S3!2021-02-03 00:30:58
 * Please try the new corpora browser2021-01-31 13:04:03


CITING THE CORPORA

If you are writing a research article in which you are using data from this
cite, please cite our paper:

Garfinkel, Farrell, Roussev and Dinolt, Bringing Science to Digital Forensics
with Standardized Forensic Corpora, DFRWS 2009, Montreal, Canada.





Website
Search
Search

Search the corpus:

Search


RECENT POSTS

 * New Android 11 and 12 Images!
 * 19 New Scenarios!
 * New WordPress
 * Downloads has been transitioned from GMU to S3!
 * Please try the new corpora browser


RECENT COMMENTS

 1. Simson Garfinkel on 2012 National Gallery DC Attack
 2. Hemanth on 2012 National Gallery DC Attack
 3. Simson Garfinkel on 19 New Scenarios!
 4. admin on Android 10
 5. Andrew Gorham on Android 10


ARCHIVES

 * September 2022
 * July 2022
 * February 2021
 * January 2021
 * November 2020
 * June 2020
 * May 2020
 * September 2019
 * August 2019
 * April 2019
 * July 2018
 * April 2017
 * May 2014
 * February 2014
 * October 2013
 * August 2013
 * April 2013
 * March 2013
 * March 2012
 * February 2011
 * January 2011
 * December 2010
 * June 2010
 * March 2010
 * August 2009


CATEGORIES

 * Disk Images
 * Files
 * General
 * NIST
 * Scenarios
 * Stats


© 2022 Digital CorporaTheme by Puro