General
The SharePoint Source module collects folders, files, versions, and metadata from SharePoint and creates transactions for further processing.
Purpose
The module enables structured extraction of SharePoint content into the system, supporting:
- Folder and file collection
- Version handling (latest or full history)
- Metadata enrichment
- Filtering and change detection
How It Works
The module operates by traversing a selected SharePoint folder structure recursively.
During execution:
- Folder structures are mapped as structure articles
- Files are mapped as document articles
- Metadata is collected, including:
- Item data
- Folder data
- Parent folder data
- Timestamps
- Version information
- Custom attributes
Depending on configuration:
- Only the latest version is collected, or
- All file versions are included
Filtering is applied to:
- Exclude specific file types
- Skip unchanged files using checksum history
When the collection completes:
- A transaction is created
- Logs capture progress, warnings, and errors
- The transaction is queued for further processing
Execution Methods
The module can be executed:
- Manually using Create Transaction
- Automatically using the Task Scheduler