W DLC-M01-001 · Collection Metadata Field Reference.docx — Word
File Home Insert Design Layout References Review View
DLC-M01-001 · Module 01 Deliverable
Collection Metadata Field Reference
Cohort
Matter

One row per source. For every collection source you're likely to see, this reference lists the metadata fields the tool outputs, what each field actually means, and how it maps to the review environment downstream. Use it when a vendor's SOW is vague, when a custodian asks what you're taking, or when opposing counsel wants to know what you have.

How to use this reference For each source in scope on your matter, confirm the vendor's proposed toolset appears in the "Common Tools" column, and that the required fields appear in their deliverable. Missing fields aren't automatically a problem — but they should be a conscious decision, not an accident.

1. Corporate Email — Microsoft 365 & Exchange

Common tools: Purview eDiscovery (Premium), Veritas EV, Nuix, X1 Distributed Discovery, Reveal Collect.
Container: .PST, .OST, .MBOX, or direct API export (.MSG or .EML per item).

FieldTypePlain-English meaning & note
MessageIDStringImmutable identifier assigned by the mail server. Do not use as the review-tool DocID — assign your own UID (see DLC-M01-002).
ConversationIDStringGroups messages in a thread. Feeds inclusive-email threading downstream. If missing, threading gets expensive.
DateSentDateTimeClient-side send time. Normalize to UTC. Watch DST edges.
DateReceivedDateTimeServer-side receive time. Often differs from DateSent; the delta matters in some matters.
From / To / Cc / BccString[]Address lists. Include display name AND SMTP address — vendors sometimes drop one.
SubjectStringVerbatim subject line. Extract without normalization; "RE:" and "FW:" carry meaning.
HasAttachmentsBooleanFlag only. Actual attachment count/names come from family fields.
AttachmentCountIntCounted post-processing after container extraction. Watch for zip-in-zip.
MD5Hash / SHA256HashStringDeduplication key. Insist on SHA-256; MD5 collisions are rare but real.
CustodianStringAssigned by collection tool, verified against interview record.
CustodianSourceStringWhich mailbox/folder within the custodian's account.
FolderPathStringFull path within the mailbox — Inbox, Sent, Deleted Items, custom folders.
ReadStatusBooleanRarely material but occasionally decisive on constructive-knowledge questions.
Importance / SensitivityStringHeader flags (Normal, High, Confidential). Sometimes evidentially significant.
DlpMatchesString[]Purview DLP labels. If present, valuable for privilege pre-triage.

2. Endpoints — Windows & macOS Workstations

Common tools: EnCase, Cellebrite Endpoint Inspector, X-Ways, Magnet AXIOM, F-Response, KAPE.
Container: Forensic image (E01 / DD / AFF4) or targeted logical collection.

FieldTypePlain-English meaning & note
FullPathStringComplete on-disk path at time of collection. Contains user context (e.g. C:\Users\jsmith\...).
FileNameStringLeaf filename with extension. Match against extension-vs-content sniffing.
ExtensionStringStated extension. Compare to content-derived type — mismatch is a signal.
MimeTypeStringContent-sniffed MIME. Preferred over Extension for routing.
FileSizeInt (bytes)Logical size. Note: sparse files can lie.
CreatedTime / ModifiedTime / AccessedTimeDateTimeMACtimes. Reliability varies — modification is the most trusted, access the least.
EntryModifiedTimeDateTimeNTFS-specific ($MFT entry time). Useful when Access has been touched by AV.
Owner / SIDStringWindows security identifier of the owning user.
SHA256HashStringCryptographic hash. Required for chain-of-custody and dedup.
DeletedFlagBooleanWhether the item was recovered from unallocated space or $Recycle.Bin.
EncryptedFlagBooleanWhether the file was encrypted at collection. Ask the vendor for their crack/re-ask strategy.
DriveLabel / VolumeStringWhere on the device the file lived (C:, D:, external, network mount).

3. Mobile Devices — iOS & Android

Common tools: Cellebrite UFED, Oxygen Detective, Magnet AXIOM, MSAB XRY, GrayKey (LE only).
Container: Filesystem image, .UFDX, .OFB, .UFDR.

FieldTypePlain-English meaning & note
DeviceModelStringMake, model, IMEI/serial. Match against the custodian's declared devices.
OSVersionStringDetermines extraction depth (iOS 17.x file-system access differs from iOS 16.x).
ExtractionMethodStringLogical / File System / Full File System / Advanced Logical. Affects what you have.
MessageThreadIDStringSMS / iMessage / RCS thread grouping.
SenderNumber / ReceiverNumberStringE.164 phone numbers. Normalize to +country format.
TimestampDateTimeDevice local time at time of send/receive. Include TZ offset.
DeletedFlagBooleanRecovered from device unallocated / SQLite journal.
ApplicationStringMessages / WhatsApp / Signal / iMessage / Telegram / Wickr / Snap.
AttachmentPathStringOn-device path to media. Preserve family relationship.
LocationLat / LocationLonFloatGeotag from EXIF or app data. Rarely on messages; occasionally on photos.

4. Ephemeral & Modern Messaging — WhatsApp, Signal, iMessage

Common tools: Cellebrite UFED, iOS Backup Extractor, Signal Desktop (with custodian password), WhatsApp Business API export.
Note: Auto-delete features may render data unavailable if preservation was not timely; document the timeline.

FieldTypePlain-English meaning & note
PlatformStringWhatsApp / Signal / iMessage / Wickr / Telegram.
ChatTypeString1:1 / Group / Broadcast / Channel.
ParticipantsString[]All members at extraction time. Historical membership may not be recoverable.
MessageIDStringPlatform-assigned. Signal is ephemeral by nature — verify persistence.
EncryptionKeyStringRequired for WhatsApp .crypt14. Handle per firm cryptography policy.
DisappearingFlagBooleanWas the message set to auto-delete? Affects the log's completeness caveat.
DisappearingSecondsIntAuto-delete window (e.g. 604800 = 7 days).
EditedFlagBooleanSome platforms surface edits. Preserve prior-version text if available.
ReactionDataJSONEmoji reactions, who and when. Often material.

5. Collaboration — Slack, Teams, Zoom Chat, Webex

Common tools: Reveal, Nuix Discover, Onna, Hanzo, native admin exports (Slack Discovery API, Purview for Teams).
Container: JSON export per channel + attachments folder.

FieldTypePlain-English meaning & note
WorkspaceIDStringSlack workspace / Teams tenant.
ChannelID / ChannelNameStringPublic / private / DM / MPIM.
ChannelPrivacyStringDetermines whether search covered it.
ThreadTSStringParent-message timestamp used as thread root.
MessageTSStringUnix-epoch send timestamp. Convert to ISO 8601 for review.
UserID / UserNameStringSender. Reconcile against Employee Directory.
MessageTextStringVerbatim, including markdown.
EditsHistoryJSON[]Prior versions if platform records them.
ReactionDataJSONReactions + reactor identities.
FileAttachmentsJSON[]Family relationship. Watch for external file links (Google Drive, Dropbox).

6. Cloud Storage — OneDrive, Google Drive, SharePoint, Box, Dropbox, S3

FieldTypePlain-English meaning & note
CloudPathStringFull path within the tenant. Owner-relative for personal drives.
SharedWithString[]Explicit shares — often overlooked; occasionally decisive on privilege.
LinkAccessStringAnyone-with-link / Domain / Restricted. Affects preservation scope.
VersionCountIntHow many prior versions the platform retained.
LastEditorStringUser who most recently touched the file.
FileIDStringPlatform-immutable ID. Survives renames.
GoogleDocsIDStringNative Google-format ID (Docs / Sheets / Slides). Export handling differs from binary files.

7. Social & Public — LinkedIn, X, Facebook, Instagram DMs

FieldTypePlain-English meaning & note
PlatformStringWhich service.
AccountHandleStringVerify handle-to-custodian mapping in interview.
PostID / MessageIDStringPlatform-specific identifier.
PostTypeStringPost / Story / Reel / DM / Comment.
PublishedTime / EditedTimeDateTimeWatch for edits between initial post and collection.
VisibilityStringPublic / Connections / Private list.
MediaURLStringWhere the platform hosted the media at collection.
EngagementMetricsJSONLikes / reshares / comments. Rarely evidentially useful, occasionally revealing on reach.

8. Network Shares & File Servers

FieldTypePlain-English meaning & note
UNCPathString\\\\server\\share\\path.
ShareNameStringDepartmental share designation.
ACL / PermissionsString[]Who could access. Feeds constructive-knowledge analysis.
LastAccessedByStringIf server logging captured it.
Practitioner note Not every field will appear in every vendor's deliverable — and not every field is needed for every matter. The purpose of this reference is to make the absence of a field a conscious decision. If a vendor says "we don't provide that," ask why, and whether it's a tool limitation or a workflow shortcut.