Marklogic only converts the half of my documents via the Content Processing Framework -


i use webdav server import pdfs, docs, pptxs , xlsxs database drag , drop. webdav server called "cpf", root "/" , port number "9999".

sorry german

and installed content processing framework standard configuration.

can have not needed security requirements?

for case marklogic says:

set needed permissions on root directory

when add documents database conversion, user adds documents must have needed permissions add , modify documents. if using webdav server drag-and-drop documents database, root directory of webdav server must have needed permissions.

one simple way accomplish these security requirements following:

  • create uri privilege uri configured root directory of webdav server.

  • create role has uri privilege , has default permissions of read. insert, , update role.

  • set permissions on webdav root directory role created. example, if role created named webdav, , root directory has uri /webdav/root/, run query (as privileged user) similar following:

xdmp:document-set-permissions("/webdav/root/",  ( xdmp:permission("webdav", "read"),  xdmp:permission("webdav", "insert"),  xdmp:permission("webdav", "update") ) ) 

you can check permissions following query:

xdmp:document-get-permissions("/webdav/root/") 

• grant new role (webdav in example above) user accesses webdav server.

in case don't "role" , "root directory" talking about?

but if error comes somewhere else? why have documents converted .xml files , others .xhtml files , 50% of original files ignored , not converted?

as suggested dave cassel, ran xdmp:document-properties() 1 of records had failed process. below result:

<?xml version="1.0" encoding="utf-8"?> <prop:properties xmlns:prop="http://marklogic.com/xdmp/property">   <cpf:processing-status xmlns:cpf="http://marklogic.com/cpf">done</cpf:processing-status>   <cpf:property-hash xmlns:cpf="http://marklogic.com/cpf">93bdf4b50736752e0155c8e16fd42544</cpf:property-hash>   <cpf:last-updated xmlns:cpf="http://marklogic.com/cpf">2016-07-25t11:26:13.006+02:00</cpf:last-updated>   <cpf:state xmlns:cpf="http://marklogic.com/cpf">http://marklogic.com/states/property-updated</cpf:state>   <cpf:self xmlns:cpf="http://marklogic.com/cpf">/xxx/pdfs/xxxxx.pdf</cpf:self>   <win32creationtime xmlns="urn:schemas-microsoft-com:">mon, 25 jul 2016 08:05:44 gmt</win32creationtime>   <win32lastaccesstime xmlns="urn:schemas-microsoft-com:">mon, 25 jul 2016 09:26:12 gmt</win32lastaccesstime>   <win32fileattributes xmlns="urn:schemas-microsoft-com:">00000000</win32fileattributes>   <win32lastmodifiedtime xmlns="urn:schemas-microsoft-com:">mon, 25 jul 2016 08:05:44 gmt</win32lastmodifiedtime> </prop:properties> 

cpf stores information state changes , errors in document's properties. diagnose what's going, go query console , run xdmp:document-properties() on 1 of documents did not processed. tell error is.


looking @ properties you've added, see state http://marklogic.com/states/property-updated , see set of microsoft properties. looking @ pipelines when go through admin ui's content processing install tab, state appears dead end -- ie, no other pipeline uses state starting point. have other processing in place creates microsoft properties?


Comments

Popular posts from this blog

jOOQ update returning clause with Oracle -

java - Warning equals/hashCode on @Data annotation lombok with inheritance -

java - BasicPathUsageException: Cannot join to attribute of basic type -