Marklogic only converts the half of my documents via the Content Processing Framework -
i use webdav server import pdfs, docs, pptxs , xlsxs database drag , drop. webdav server called "cpf", root "/" , port number "9999".
and installed content processing framework standard configuration.
can have not needed security requirements?
for case marklogic says:
set needed permissions on root directory
when add documents database conversion, user adds documents must have needed permissions add , modify documents. if using webdav server drag-and-drop documents database, root directory of webdav server must have needed permissions.
one simple way accomplish these security requirements following:
create uri privilege uri configured root directory of webdav server.
create role has uri privilege , has default permissions of read. insert, , update role.
set permissions on webdav root directory role created. example, if role created named webdav, , root directory has uri /webdav/root/, run query (as privileged user) similar following:
xdmp:document-set-permissions("/webdav/root/", ( xdmp:permission("webdav", "read"), xdmp:permission("webdav", "insert"), xdmp:permission("webdav", "update") ) )
you can check permissions following query:
xdmp:document-get-permissions("/webdav/root/")
• grant new role (webdav in example above) user accesses webdav server.
in case don't "role" , "root directory" talking about?
but if error comes somewhere else? why have documents converted .xml files , others .xhtml files , 50% of original files ignored , not converted?
as suggested dave cassel, ran xdmp:document-properties()
1 of records had failed process. below result:
<?xml version="1.0" encoding="utf-8"?> <prop:properties xmlns:prop="http://marklogic.com/xdmp/property"> <cpf:processing-status xmlns:cpf="http://marklogic.com/cpf">done</cpf:processing-status> <cpf:property-hash xmlns:cpf="http://marklogic.com/cpf">93bdf4b50736752e0155c8e16fd42544</cpf:property-hash> <cpf:last-updated xmlns:cpf="http://marklogic.com/cpf">2016-07-25t11:26:13.006+02:00</cpf:last-updated> <cpf:state xmlns:cpf="http://marklogic.com/cpf">http://marklogic.com/states/property-updated</cpf:state> <cpf:self xmlns:cpf="http://marklogic.com/cpf">/xxx/pdfs/xxxxx.pdf</cpf:self> <win32creationtime xmlns="urn:schemas-microsoft-com:">mon, 25 jul 2016 08:05:44 gmt</win32creationtime> <win32lastaccesstime xmlns="urn:schemas-microsoft-com:">mon, 25 jul 2016 09:26:12 gmt</win32lastaccesstime> <win32fileattributes xmlns="urn:schemas-microsoft-com:">00000000</win32fileattributes> <win32lastmodifiedtime xmlns="urn:schemas-microsoft-com:">mon, 25 jul 2016 08:05:44 gmt</win32lastmodifiedtime> </prop:properties>
cpf stores information state changes , errors in document's properties. diagnose what's going, go query console , run xdmp:document-properties() on 1 of documents did not processed. tell error is.
looking @ properties you've added, see state http://marklogic.com/states/property-updated
, see set of microsoft properties. looking @ pipelines when go through admin ui's content processing install tab, state appears dead end -- ie, no other pipeline uses state starting point. have other processing in place creates microsoft properties?
Comments
Post a Comment