Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-11531

Add mysql-style LIMIT support to Hive, or improve ROW_NUMBER performance-wise

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0
    • CBO

    Description

      For any UIs that involve pagination, it is useful to issue queries in the form SELECT ... LIMIT X,Y where X,Y are coordinates inside the result to be paginated (which can be extremely large by itself). At present, ROW_NUMBER can be used to achieve this effect, but optimizations for LIMIT such as TopN in ReduceSink do not apply to ROW_NUMBER. We can add first class support for "skip" to existing limit, or improve ROW_NUMBER for better performance

      Attachments

        1. HIVE-11531.02.patch
          214 kB
          Hui Zheng
        2. HIVE-11531.03.patch
          216 kB
          Hui Zheng
        3. HIVE-11531.04.patch
          216 kB
          Sergey Shelukhin
        4. HIVE-11531.05.patch
          214 kB
          Hui Zheng
        5. HIVE-11531.06.patch
          214 kB
          Sergey Shelukhin
        6. HIVE-11531.07.patch
          214 kB
          Hui Zheng
        7. HIVE-11531.patch
          228 kB
          Hui Zheng
        8. HIVE-11531.WIP.1.patch
          18 kB
          Hui Zheng
        9. HIVE-11531.WIP.2.patch
          49 kB
          Hui Zheng

        Issue Links

          Activity

            People

              huizane Hui Zheng
              sershe Sergey Shelukhin
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: