Version 28 (modified by 6 years ago) ( diff ) | ,
---|
GRASS 8 planning: ideas
For now just a brainstorming zone...
General
Naming of GRASS Database
- GISDBASE versus grassdata versus GRASS Database versus GRASS Database Directory versus GRASS GIS Database versus GRASS GIS Spatial Database versus GIS Data Directory
- The Data tab (Data Catalog), avoids it altogether (in 7.4) and says "GRASS locations in /abs/path/here"
- Problem 1: It is hard to understand for beginners that the messing with the data in "grassdata" is not useful and can be harmful.
- Problem 2: Non-unix users confuse current working directory with "grassdata".
- Problem 3: The same things is referred to in different ways (GISDBASE, grassdata, GRASS Database Directory, dbase).
- Problem 4: Instructions often say "set your location and mapset" while they mean "set 'database directory', then location, and then mapset, and optionally also working directory."
- Problem 5: When setting up "GRASS session" manually, e.g. in Python, GISBASE and GISDBASE look too much alike.
- Solution: Make it clear that users are dealing with a database and while also emphasizing that the database is spatial as opposed to just attributes (as in GIS vector/spreadsheet tables).
- Outcome: Consistent naming of "grassdata" in doc, GUI, API (measurable). Clear way of creating instructions for beginners. Improved understating of the concept (long-term).
- Challenges: name length, strong legacy ("GISDBASE" on many places), backwards compatibility (API changes), convention (such as grassdata) versus naming, different internal naming, relation to location and mapset, GRASS versus GRASS GIS, lowercase letters (as in thing, concept) versus uppercase (as in file format)
- Consistency is perhaps the main thing needed:
- GISDBASE is an environment variable and an abbreviated form of "GRASS GIS Database",
- "dbase=" is an argument to change GISDBASE in several modules (e.g. g.mapset, r.reproject, etc),
- "grassdata" is a commonly used name of a directory that serves as a GRASS GIS Database. However, a person can have more than one GIS Database or may want to give it a different name. That should be OK.
- The modules that change GISDBASE should have arguments of "gisdbase=", not "dbase=". That would alleviate considerable confusion
- When referring to the GIS Database in the interface, we should use the term "GIS Database" (or something similar) consistently. Again, consistency can help with confusion here.
Naming of Location
- replace the term location (Location, LOCATION)
- See wiki:wxGUIDevelopment/New_Startup and grass-dev: a proposal to rename location.
- Related: #2681 Remove legacy meaning of LOCATION variable
grass executable
- Refer to it as
grass
in the documentation to avoid rewritinggrass6
tograss7
tograss8
(or even with minor versions). Request/require packagers to creategrass
command (they already often do that). - Use the general "Linux" (POSIX+GNU) standard for options/flags/parameters (of the executable).
- For example, use
-c
and--create
rather than-create
, i.e. avoid long flag to have one dash and provide both alternatives. - Supporting current/legacy syntax like
-text
and-gui
might be too difficult (definitely remove from manual). - Same for module-like syntax which could be supported as well, e.g. supporting both
--create=EPSG:3358
andcreate=EPSG:3358
. - Require the order of arguments/operands to be correct (now not enforced).
- Only exception to the rule should be
--help
. - Possibly use one of Python libraries (argparse >2.7 and >3.2, https://docs.python.org/3.6/library/optparse.html in 2.6 and 3.8? but deprecated since 2.7, getopt not recommended for general use).
- For example, use
Raster library
- organization of raster file storage layout: have one raster folder per map like for vector data or raster3D
- for fp maps: move fcell to cell, eliminating empty cell file
- implement GRASS_RASTER_TMPDIR_MAPSET like it exists for vector data (GRASS_VECTOR_TMPDIR_MAPSET), i.e. change all .tmp/ to variable in source code in init/grass.py, gis/open.c, gis/file_name.c, raster library
- maintain history as it is already for vector maps
- synchronize metadata between raster and vector maps
- keep track of open raster maps (already done in the R structure)
- Storage in tiles instead of by row (See Grass7/RasterLib)
- Merge NULL file into main data array (See Grass7/RasterLib)
- save more raster metadata like number of non-null cells, mean and stddev (see GDAL)
- MASK: add support for writing raster data (not only reading)
- add tile support for better large map support (Sentinel, global data, ...), supporting massive parallel computations (based on discussions from Aug 2017)
- it must be clear why this would better than GDAL vrt combined with r.external
- tiles could then be even stored on different nodes for speed optimization
- storage implementation:
- develop a new virtual raster mapset "VRT" (special like PERMANENT)
- virtual map: combination between current groups/stds and external maps - map metadata link the other maps, possibly in other mapset (Vaclav: I suggest this (vrt map) rather than vrt mapset or low level (i.e. new format) tiling)
- something like the segment library opens the appropriate "standard" raster maps
- different storage nodes possible if maps are different mapsets which are on diff nodes
- reading:
- [the above options or] add tile support deeply into raster lib (Rast_get_row())
- use name scheme? make use of segment library
- problem: due to row compression always whole row is read even if computation region is smaller
- writing:
- it is more complex
Imagery library
- get rid of subgroups. No real need for that... (but overcomplication of usage)
- or strengthen subgroups. One of options could be auto creating some "magic" subgroups as i.e. "_all" - all raster maps in a group (aka if subgroup is not set, use all maps from group); "_initial" - only imported remote sensing layers (equals "_all" if no new maps are added later); "RGB" - if source is RGB or RGB can be defined (I would love to be able to have an option to choose a group in d.rgb and get RGB subgroup selected as a default instead of searching for separate channels); "MLC" subgroup - see #2483.
Vector library
- keep track of opened vector maps
- keep track of dblinks to not remove table connected to multiple vector maps
- get rid of dbf as database backend
- portability and attribute file management would be much easier if each vector map had its own SQLite database (with the potential for multiple tables) rather than a single database for all vectors and their tables (layers) for the entire mapset
- ...
Python library
- finalize/stabilize Python 3 support
- simplify the startup from Python script (i.e. less steps to start session from Python, possibly includes change in distribution/installation)
- remove deprecated Python functions: https://trac.osgeo.org/grass/changeset/67669
GUI
- review (again) startup window, setting database/location/mapset, initial (default) location (wiki:wxGUIDevelopment/New_Startup)
- single window interface as an option (first step: moving code to controller classes and panels) (wiki:wxGUIDevelopment/SingleWindow)
- avoid the need for setting up path to packaged before imports (now we need to mix code and imports), e.g. grassgui package next to grass package
- integrated Addon and GUI toolboxes
- consistently inform user about whether or not an operation uses current computational region
Manual pages
- generate Sphinx manuals (rst converter is already in man/)
Bug reports
Blockers
defect type tickets:
- #3023
- Change default behavior of d.title to draw instead of output text
- #3055
- Revise monochromatic color tables
task type tickets:
Critical issues
No results
Further issues
- Check overview in: https://trac.osgeo.org/grass/milestone/8.0.0
Note:
See TracWiki
for help on using the wiki.