Skip to content

v5.4.0

Choose a tag to compare

@marklogic-builder marklogic-builder released this 26 Feb 22:38
· 1947 commits to master since this release

Release Notes for Data Hub 5.4.0

Release Summary

Data Hub 5.4 introduces major features and improvements in these areas:

Hub Central

  • Load
    Users can load CSV, XML, JSON, binary or text files with model-based faceting/sorting, including source type and source name facets
  • Curation
    Users can map fields in source files to entity properties, including structured types (e.g. Address, which consists of Street, City, State)
    Users can run any type of step, and view custom step settings
  • Curation: Smart Mastering - Users can now configure match and merge steps in Hub Central using a simple, graphical UI:
    Add thresholds and define match rulesets, and adjust with a slider
    Define merge strategies and add merge rules
    View possible combinations of Matched Rulesets
    Change match configuration settings and merge step settings
  • Modeling - Users can create, edit, delete and save entity types and properties
    Create simple (int, string, etc.) or deeply-nested structured properties, including multi-value arrays
    Create relationship property types, and link them at the entity type level
    Denote when properties have multiple values of their type
    Enable sorting and faceting for properties
  • Data Exploration - Users can now search and explore with raw data in addition to curated data.
    Explore raw and curated data with model-based faceting/sorting
    Search and explore raw and curated data via table and snippet views
    Facet and filter data on properties defined in the model
    Save search queries
    Export searches in CSV format directly via the UI

Other Improvements

  • Smart Mastering with Gradle
  • Single value structured property matching (select 1 property)
  • Single-value structured property matching (select multiple properties)
  • Multi-value structured property matching
  • Multi-value with multiple structured type matching
Step Flexibility
  • Custom steps can now process more than URIs. You can now set sourceQuery to a script that retrieves items of different data types to pass to your custom step module.
  • New step interceptors allow you to manipulate processed documents before they are persisted. They can be configured for any step and run in the same step transaction. One application of this is adding permissions based on document contents.

Other Updates

  • Users can now add source type and name metadata while loading data to indicate source systems
  • Users can now disable auto-generation of TDEs
  • New Gradle tasks for clearing user artifacts and modules, and describing users and roles
  • Users now have the ability exclude processed items when running a step

Also includes enhancements and bug fixes, for more details check release notes.

Full Release Notes

Package Key Method
marklogic-datahub-5.4.0.war fd8fc0fdc5fd97d289f9aba89eccc39becb82c11 SHA1SUM
marklogic-data-hub-5.4.0-client.jar 28ec7f8061fefc4212f84f4af96f9d7c7d8f2070 SHA1SUM
marklogic-data-hub-central-5.4.0.war f03fbe9c56cbed73153e9a18660e936367bb47e2 SHA1SUM