Skip to main content

Notice

The new RDA web platform is still being rolled out. Existing RDA members PLEASE REACTIVATE YOUR ACCOUNT using this link: https://rda-login.wicketcloud.com/users/confirmation. Please report bugs, broken links and provide your feedback using the UserSnap tool on the bottom right corner of each page. Stay updated about the web site milestones at https://www.rd-alliance.org/rda-web-platform-upcoming-features-and-functionalities/.

Array Database assessment Working Group meeting at Plenary 8

  • Creator
    Discussion
  • #137810

    Peter Baumann
    Participant

    This page documents the inaugural meeting of the Array Database assessment Working Group at Plenary 8 in Denver, USA, 2016-09-16.

    Agenda

    In order to maximise focus in getting hands-on, just two introductory presetnations were given by the chairs, then leading into a discussion on how to proceed concretely.

    Discussion

    • Q: what is the focus of this WG?
      A: Focus is strictly on arrays (grids, rasters) to be able to achieve meaningful results
    • Q: Are only array databases in focus?
      A: Anything that offers a flexible query interface is of interest, and could be called a “database system” in the widest sense. However, further array-related technology should be inspected as well to obtain comparisons (e.g., MapReduce-based systems like SciHadoop, GeoTrellis, MrGeo) – but this depends on the contributions; in other words: come and champion your favourite tool!
    • Q: what is a realistically large deployment?
      A: anything larger than server RAM qualifies (otherwise, you just load data into RAM without any need for a database system); as a rule of thumb, going beyond 10 TB should be fine.
    • Q: what kind of contributions do you seek?
      A: three types of contributors are needed in order to have comprehensive expertise on board:

      • use case providers: they bring in data and challenges (aka queries)
      • service providers: they provide storage and compute space for setting up realistically large deployments of systems
      • technology (software) providers: they provide their tools for installation and operation, as well as support.
    • In an elicitation of the interests of those present the following was articulated:
      • ETL: 2
      • SQL/MDA: 4
      • formats (HDF, …): 5
      • query power (Fourier Transform etc.): 3
      • OPeNDAP: 2
      • model-to-model communication via database: 2

    Outcome

    The following concrete initial groups shaped:

    • use case providers:
      • Mohan: Earth science
      • Schaaff: Space science
    • data center: none
    • software providers:
      • Peter Baumann: rasdaman
      • Ted Habermann: HDF
      • Mohan: HDF, OPeNDAP
      • Doug Lindholm: tbd

    Action items

    • chairs: convene group, grow it from membership via email – until beginning of Oct
    • data center group: establish infrastructure for installation – tbd
    • software providers: stand up installations, provide tutorials – once installation points are defined

     

     

     

     

     

    2016-09-16_RDA_ADWG.pdf

Log in to reply.