Skip to content

A framework for basic parallel model of geographic vector data based on Apache Spark.

License

Notifications You must be signed in to change notification settings

katus98/gis-vector-spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gis-vector-spark

A framework for basic parallel model of geographic vector data based on Apache Spark.

Basic GIS model

Geo Processing Tools

  • Buffer (v1.0)
  • Clip (v1.0)
  • Convex Hull (v1.0)
  • Dissolve (v1.0)
  • Erase (v1.0)
  • Enlarge
  • Intersection (v1.0)
  • Symmetrical Difference (v1.0)
  • Union (v1.0)

Geometry Tools

  • Centroids
  • Check Validity
  • Collect
  • Delaunay Triangulation
  • Densify
  • Extract Vertices
  • Geometry Calculator
  • Lines To Polygons
  • Multi Part To Single Parts (Multi To Single) (v1.0)
  • Polygons To Lines
  • Simplify
  • Voronoi Polygons

Attribute Tools

  • Join (v1.0)
  • Selector
    • Numerical Selector (v1.0)
    • Text Selector (v1.0)
  • Statistics (v1.0)
  • Unique Values

Research Tools

  • Create Grids
  • Random Points
    • Random Points In Extent
    • Random Points Inside Polygons
  • Random Selection (v1.0)
  • Regular Points

Analysis Tools

  • Distance Matrix
  • Feature Spatial Count
  • Line Intersections
  • Mean Coordinates
  • Sum Line Lengths

Data Management Tools

  • Merge (v1.0)
  • Project (v1.1)
  • Spatial Join (v1.0)
  • Split

GIS Vector Data IO

Data Input

  • ESRI Shape File (v1.0)
  • File / Personal Geo Database (v1.2, Need GDAL)
  • Relational Database
    • MySQL Database (v1.1)
    • PostgreSQL Database (v1.1)
    • PostgreSQL Databases With Citus (v1.2)
  • Text File (v1.0)

Data Output

  • ESRI Shape File
  • Relational Database
    • MySQL Database
    • PostgreSQL Database
  • Text File (v1.0)

User Manual

Example of program arguments

# Buffer
-output /D:/Data/buffer.csv
-input postgresql:table
-distance 50
-geometryFields WKT
-crsUnit 3857
-serialField _id

# Clip
-output /D:/Data/clip.csv
-needHeader true
-crs 4326
-input1 /D:/Data/target.shp
-hasHeader1 false
-isWkt1 true
-geometryFields1 -1
-geometryType1 LineString
-separator1 \t
-crs1 4326
-charset1 UTF-8
-input2 /D:/Data/extent.shp
-hasHeader2 false
-isWkt2 true
-geometryFields2 -1
-geometryType2 LineString
-separator2 \t
-crs2 4326
-charset2 UTF-8

# MultiToSingle
-output /D:/Data/mts.csv
-needHeader true
-crs 4326
-input /D:/Data/target.shp
-ratio 0.5
-hasHeader false
-isWkt true
-geometryFields -1
-geometryType LineString
-separator \t
-crs 4326
-charset UTF-8

Contributors

Project Manager

  • SUN Katus

Model Author

  • SUN Katus
  • WANG Mengxiao

About

A framework for basic parallel model of geographic vector data based on Apache Spark.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages