Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-9974

Sensitive data redaction: data appears in name of mapreduce job

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.0.0
    • 1.3.0, 2.0.0
    • None
    • None

    Description

      Set up a cluster, configured a redaction rule to redact "B0096EZHM2", and ran Hive queries on the cluster.

      Looking at the YARN RM web UI and Job History Server web UI, I see that the mapreduce jobs spawned by the Hive queries have the sensitive data ("B0096EZHM2") showing in the job names:

      e.g., "select product, useri...product='B0096EZHM2'(Stage"

      Attachments

        1. HIVE-9974.1.patch
          0.9 kB
          Sergio Peña

        Issue Links

          Activity

            People

              spena Sergio Peña
              spena Sergio Peña
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: