Testing TotalDepth With Recorded Data¶

TotalDepth is tested against a diverse data set of real world files. The test set is split into small/medium/large datasets.

Small Test Set¶

todo:	Complete this

Medium Test Set¶

The Medium Size Test Set is 20,000+ files (130Gb+) of typical oilfield data. Here is the approximate breakdown of the test set:

File Type	Files	Total Size	Notes
BIT	~500	~1.5Gb	Largest file is around 6Mb.
LAS v1.2	~500	~1Gb	Largest file is around 16Mb.
LAS v2.0	~20,000	~30Gb	Largest file is around 250Mb (RP66V1 converted files are much larger).
LAS v3.0	A few	~0	Rarely present, their absence is not considered significant.
LIS	~2000	~2GB	Largest file is around 60Mb. Around half have TIF markers.
DLIS (RP66V1)	~800	~100GB	Largest file is around 4GB. About one quarter are corrupted by TIF markers.
DLIS (RP66V2)	0	0	Not present, their absence is not considered significant.
Other	Various	Various	Various files such as PDF, TIFF, miscellaneous binary files and unstructured ASCII. If present then their contents is not considered significant but archives containing these should be processed without drama.

The layout of the test set is typical of an oilfield repository, typically by well, with a well having an unspecified directory structure and a mix of file types in each directory.

The aim is that TotalDepth can process 97+% of the files in this archive in the formats that TotalDepth supports.

Large Test Set¶

todo:	Complete this

Synthetic Test Set¶

TotalDepth can generate arbitrary sized files in a number of formats and this test set is used for performance analysis. This test set is not distributed with TotalDepth but the means to create it is.