Geometry

namespace: geometry

The geometry module is a foundation of the entire system. It defines some very basic data structures and implements basic operations and algorithms, including points, transformation matrix, the conversion between Lie groups and Lie algebras (SE(3), se(3)) etc., as well as the definition and implementation of common data structures such as point clouds, triangular mesh and related algorithms.

Geometry.h

In Geometry.h, the first thing you shuold notice is that:

typedef float scalar;

By default, float is used as the scalar type, so the format of point clouds, matrix, etc. are all single-precision, and the floating-point number precision used by the entire library can be changed by adding -DUSING_FLOAT64 into CMAKE_CXX_FLAGS, in CMakeLists.txt.

typedef Vector6 Se3;
typedef Matrix4 SE3;

Lie group Lie algebras are defined as SE3 and Se3, which are four-dimensional square matrix and six-dimensional matrix, respectively, through the function:

Matrix4 Se3ToSE3(const Vector6 &input);
Vector6 SE3ToSe3(const Matrix4 &input);

you can do the conversion. These functions are implemented based on Sophus library(In Open3D, the mutual conversion from a 6-dimensional vector to a 4-dimensional square matrix is completely unchanged for the translation part, but from the mathematical derivation, there is a slight difference between the two, so the Sophus library should be more accurate).

struct VoxelGridHasher;
struct PixelGridHasher;

The above two are hash functions for three-dimensional and two-dimensional integer vectors, and spatial hashing is a very efficient data structure.

std::tuple<Point3, double, double> FitPlane(const Point3List & _points);
std::tuple<Vector2, double, double> FitLine(const Point2List & _points);

The above functions are used to fit a set of 3D point planes and 2D point lines. For example, in a three-dimensional coordinate system, the parametric equation of a plane is: \(ax+by+cz+d = 0\), where \(n=\{ a, b, c\)^T\) is the normal vector of the plane. The return value type of the above function is std::tuple, where the first term is the normal vector, and the second is \(d\). The third one is a ratio of the second largest singular value to the largest singular value (using SVD to fit a plane or a straight line, you can get singular values), which can be used as an indicator of the error between the original points and the fitted plane. If all points are in On the plane, the value is 0. That is, the better the plane fits, the smaller the value.

OnePiece use Point3List to denote an array of Eigen::Matrix<geometry::scalar, 3, 1>, similarly, PointXList represents an array of Eigen::Matrix<scalar, Eigen::Dynamic, 1>. Also, with cpp 11, we can give an alias for Eigen::Matrix<geometry::Scalar, T, 1> by:

    template <int T>
        using Vector = Eigen::Matrix<scalar, T, 1>;

So you can use Vector<10> to declare a 10D vector. Therefore, we also use PointList to denote an array of template vector, which means we use PointList<100> as an alias of std::vector<Eigen::Matrix<scalar, 100, 1>, Eigen::aligned_allocator<Eigen::Matrix<scalar, 100, 1>>>. This is much useful.

In Geometry.h, some other aliases are also defined. These aliases are defined for easy-understanding. Some basic knowledges of Feature Matching, etc. is needed to understand the meanings.

Geometry2d.h

Geometry2d.h implements some geometric algorithms on 2d, mainly to calculate some very basic content of geometry. This part is written for DCEL (doubly connected edge list) in Algorithm. The main functions in this part are:

//Check if a point is inside a convex polygon
int CheckPointInConvexPoly(const geometry::Point2List &points, const Point2 &p);
//Calculate the area of a convex polygon
float ComputeAreaConvexPoly(const Point2List &points);

PointCloud.h

PointCloud.h implements some point cloud-related data structures and algorithms. The point cloud contains points, colors, and normals (the latter two are optional). The related algorithms include point cloud downsampling, normal vector estimation, and so on. Some basic member functions are listed below:

//Read PCD from PLY file
void LoadFromPLY(const std::string &filename);
//Read PCD from OBJ file
void LoadFromOBJ(const std::string &filename);
//Read PCD from color and depth
void LoadFromRGBD(const cv::Mat &rgb, const cv::Mat & depth, const camera::PinholeCamera &camera );
void LoadFromRGBD(const RGBDFrame &rbgd, const camera::PinholeCamera &camera );
//Estimate the normal vector by fitting a plane to the surrounding points of a certain point. 
//`radius` represents the search radius, and `knn` represents the number of fitting points
void EstimateNormals(float radius = 0.1, int knn = 30);
//Transform pcd
void Transform(const TransformationMatrix &T);
//Down sample 
std::shared_ptr<PointCloud> DownSample(double grid_len);
//Write pcd to PLY file
bool WriteToPLY(const std::string &fileName) const;
//Write pcd to OBJ file
bool WriteToOBJ(const std::string &fileName) const;
//Merge this pcd with another one
void MergePCD(const PointCloud & another_pcd);

TriangleMesh.h

TriangleMesh defines the triangular mesh data structures and related algorithms. The biggest difference between a triangular mesh and a point cloud is that there are edges and faces in triangle mesh. In the triangular mesh, each face is a triangle and consists of 3 points. Therefore, there is also a face table in the triangular mesh, which stores the indexes of the 3 points of each triangle. The currently implemented grid does not contain textures.

//Read mesh from PLY file
void LoadFromPLY(const std::string &filename);
// Read Mesh from OBJ file
void LoadFromOBJ(const std::string &filename);
/*
Calculating the normal vector, the normal vector of the triangular grid is easier to estimate compared to the normal vector of the pcd.
First of all, the triangular mesh already has a face, and you can directly find the plane based on the three points to get the normal vector of the face.
Then, we only need to perform a weighted average of the normal vectors of all the faces connected to the point to get the normal vector of a point.
*/
void ComputeNormals(float radius = 0.1, int knn = 30);
//Transform the mesh
void Transform(const TransformationMatrix &T);
//Write mesh to PLY file
bool WriteToPLY(const std::string &fileName) const;
//Write mesh to OBJ file
bool WriteToOBJ(const std::string &fileName) const;
//Simplify mesh with edge collapse based on quadric error
std::shared_ptr<geometry::TriangleMesh> QuadricSimplify(int target_num) const;
//Voxel-based uniform grid simplification, similar to point cloud downsampling
std::shared_ptr<geometry::TriangleMesh> ClusteringSimplify(float grid_len) const;
//Remove fragments whose connected points is less than a threshold
std::shared_ptr<geometry::TriangleMesh> Prune(int min_points) const;

For details of quadric simplification, refer to:related_paper/quadrics.pdf.

Comparison of simplification: Simplification

Ransac.h

Ransac.h mainly implements two specific algorithms that use ransac, based on a basic Ransac framework GRANSAC. The advantage of Ransac is that it will randomly extract some points are used to find a certain transformation or model, so that outliers can be largely excluded. For example, when fitting a plane, if you use the FitPlane in Geometry.h to directly fit, then the outlier will have a great impact on the result. And using Ransac can get better results.

//Use Ransac to estimate the relative transformation
TransformationMatrix EstimateRigidTransformationRANSAC(const PointCorrespondenceSet &correspondence_set,
    PointCorrespondenceSet & inliers, std::vector<int> &inlier_ids, int max_iteration = 2000, float threshold = 0.1);
//Use Ransac to fit plane
std::tuple<geometry::Point3, double, double> FitPlaneRANSAC(const Point3List &points, 
    Point3List & inliers, std::vector<int> &inlier_ids, int max_iteration = 2000, float threshold = 0.05);

You can get the inlies and corresponding indices using above function.

RGBDFrame.h

RGBDFrame.h defines the RGBDFrame data structure, which integrates the depth map and the color map, and can simplify the calling process. This data structure is mainly used to serve an RGBD SLAM system. It can save some of the extracted features, point cloud and so on, which are needed for tracking. When building an RGBD SLAM, for efficiency these contents do not need to be recalculated. Release() is used to release all saved content.

KDTree.h

There are two underlying implementations of KDTree, which are based on OpenCV and nanoflann respectively. You can choose any of the by settingt the pre-defined variable NANO_IMPLAMENTATION.

OpenCV (Not Recommended)

The most troublesome part of calling OpenCV's KDTree is the mutual conversion between Eigen and OpenCv, and in fact, geometry::KDTree is to solve this.

The following is a piece of code (from the Internet) that uses kdtree in OpenCV.

int main() {
    //Points to build tree
    vector<cv::Point2f> features = { { 1,1 },{ 2, 2},{ 3, 3},{ 4, 4},{ 2, 4} };
    cv::Mat source = cv::Mat(features).reshape(1);
    source.convertTo(source, CV_32F);

    cv::flann::KDTreeIndexParams indexParams(2);
    cv::flann::Index kdtree(source, indexParams); 

    //Parameter
    int queryNum = 3;//knn 
    vector<float> vecQuery(2);//query point
    vector<int> vecIndex(queryNum);//indices of the k nearest points
    vector<float> vecDist(queryNum);//distances
    cv::flann::SearchParams params(32);//max search depth


    vecQuery = { 3, 4};
    kdtree.knnSearch(vecQuery, vecIndex, vecDist, queryNum, params);

    cout << "vecDist: " << endl;
    for (auto&x : vecDist)
        cout << x << " ";
    cout << endl;

    cout << "vecIndex: " << endl;
    for (auto&x : vecIndex)
        cout << x << " ";

    return 0;
}

There is some points you should be careful: 1. FLANN in OpenCV does not support double 2. In this part, the type of query point is vector<float>，if you are using cv::Mat(features[0]), there will be a segmentation fault. 3. You need to maintain the input array all the time. So if you want to separate the building process and searching process, you need to maintain a shared matrix cloned from the origin matrix. OnePiece uses a member(input_array) in class. 4. knnSearch won't return any value, and you need to do a resize(k) for indices and distances array before searching. In radiusSearch, you don't have to resize the indices and distances array. radiusSearch will reture a number of searched points. You need to use the retured value to iterate the indices and distance array rather than indices.size(), because indices.size() is equal to max_points you set before, which is the maximum number of points you want to search.

Anyway, you have no needs to worry about these problems in OnePiece. In OnePiece, KDTree is very simple to use. It is a template class, that is, you need to specify the dimensions of the data placed in the tree. When building the tree, directly input the PointList series defined in geometry.h (essentially the vector of the Eigen vector), which can be a dynamic vector array or a static vector array. If it is a dynamic vector array, the dimension of the dynamic vector needs to be equal to the dimension of the template. In addition, searching through RadiusSearch and KnnSearch can also be directly based on the vector of Eigen, and OnePiece doesn't have any assumption about the indices array and distances array.

nanoflann

nanoflann is a very lightweight C++11 library, with only one hpp file. Compared with the implementation of OpenCV, nanoflann is more efficient and occupies less memory. In nanoflann, you need to declare a class for each kinds of points(such as 2D or 3D points) that needs to be searched, which contains certain functions. In OnePiece, there is no need to do this. Nanoflann's radiusSearch cannot specify max_neighbors, which means it will find all points within the radius. This is not necessary in some cases and can cause waste of computing resources. OnePiece modified the source code of nanoflann to get this feature (all points will be searched when max_neighbors=0).

The KDTree based on nanoflann implementation has the same usage of KDTree based on OpenCV. Additionally, it supports indices of std::vector<size_t>.

    class SearchParameter
    {
        public:
        //kdtree parameter
        SearchParameter(int _checks = 256, 
            float _eps = 1e-8, bool _sorted = true)
        {
            checks = _checks;
            eps = _eps;
            sorted = _sorted;
        }
        //the indices array is sorted based on the distances
        bool sorted = true;
        //EPS
        float eps = 1e-8;
        //Max search depth
        int checks = 256;


    };
    template<int T = 3>
    class KDTree
    {
        public:
        KDTree(int _trees = 4)
        {
            trees = 4;
        }
        //To build KDTree from a set of high-dimensional vectors, you can choose to use a dynamic vector vector or a static vector vector
        void BuildTree(const geometry::PointXList &points);

        void BuildTree(const geometry::PointList<T> &points);
        //strange rules in opencv flann: 
        //when you do knnsearch, you need to resize(k) at beginning.
        //when you do radius search, you need to get the returned points_count and do a resize for the results.
        //search points within radius for a dynamic vector or static vector 
        void RadiusSearch(const geometry::VectorX &point, std::vector<int> &indices, 
            std::vector<float > &dists, double radius, int max_result, 
            const SearchParameter &sp = SearchParameter());
        void RadiusSearch(const geometry::Vector<T> &point, std::vector<int> &indices, 
            std::vector<float > &dists, double radius, int max_result, 
            const SearchParameter sp = SearchParameter());
        //search the k nearest points for a dynamic vector or static vector 
        void KnnSearch(const geometry::VectorX &point, std::vector<int> &indices, 
            std::vector<float > &dists, int k, 
            const SearchParameter &sp = SearchParameter());
        void KnnSearch(const geometry::Vector<T> &point, std::vector<int> &indices, 
            std::vector<float > &dists, int k, 
            const SearchParameter &sp = SearchParameter());

#if NANO_IMPLEMENTATION
        //only nanoflann based KDTree supports
        void RadiusSearch(const geometry::VectorX &point, std::vector<size_t> &indices, 
            std::vector<float > &dists, double radius, int max_result, 
            const SearchParameter &sp = SearchParameter());
        void RadiusSearch(const geometry::Vector<T> &point, std::vector<size_t> &indices, 
            std::vector<float > &dists, double radius, int max_result, 
            const SearchParameter sp = SearchParameter());
        //search the k nearest points for a dynamic vector or static vector 
        void KnnSearch(const geometry::VectorX &point, std::vector<size_t> &indices, 
            std::vector<float > &dists, int k, 
            const SearchParameter &sp = SearchParameter());
        void KnnSearch(const geometry::Vector<T> &point, std::vector<size_t> &indices, 
            std::vector<float > &dists, int k, 
            const SearchParameter &sp = SearchParameter());
#endif
    };

You can view example/GetLabelUsingKDTree.cpp to better understand how to use KDTree in OnePiece. The input is two models: original model (without semantic information) and reference model (with semantic information). By finding the closest point through KDTree, we can obtain the original model's semantic labels.