Best practice for bulk uploads
From KnowledgeTree Community
Here are a few best-practices for bulk uploads into KnowledgeTree:
- 1) Disable document indexing while uploading. You may use the Indexer Diagnostic Plugin to perform Indexing at a later stage: http://forge.knowledgetree.com/projects/inddiagnostic/
- To disable indexing:
- a) Browse to "Administration » Miscellaneous » Plugins Management"
- b) Deselect the "Full-text Content Indexing" plugin
- c) Press update.
- To disable indexing:
- 2) Use the bulk-import command line tool available on our community forge: (link not working) http://forge.knowledgetree.com/projects/cmdbulkimp/
The bulk-import command line tool writes document information directly into the database. You will need to perform indexing at a later stage (see Indexer Diagnostic Plugin above).
Note that this is a community project and we do not support its use. We recommend thoroughly testing the tool before using it in a production environment.
- 3) Upload your documents in smaller segments rather than all at once. E.g. If you have 100 000 documents to migrate, try uploading 10 000 at a time. That way, if one of the uploads fails, you won't need to start from scratch. It is often worth starting the migration process by testing bulk uploads of different sizes, to find the optimal number for your particular network/server configuration.
Supported compression formats
KnowledgeTree supports the following compression and archive formats: Zip (.zip) Unix archiver (.ar) GZip or GNU Zip (.gz) Tape Archive (.tar) Tape archive and gzip - used together (.tar.gz and .tgz) .deb (generally used for Debian packages)
Note: Although Bzip2 (.bz2) is supported, this requires a php library that isn't currently in the stack, but may be in included in future releases.
Supported extensions include:
tgz
tar
gz
zip
deb
ar
tar.gz