crawlserv++
[under development]
Application for crawling and analyzing textual content of websites.
|
Namespace for classes handling queries. More...
Classes | |
class | Container |
Query container. More... | |
class | JsonPath |
Implements a JSONPath query using the jsoncons library. More... | |
class | JsonPointer |
Implements an extended JSONPointer query using the rapidJSON library. More... | |
class | RegEx |
Implements a RegEx query using the PCRE2 library. More... | |
class | XPath |
Implements a XPath query using the pugixml library. More... | |
Constants | |
constexpr auto | pcre2ErrorBufferLength {1024} |
The length of the error buffer used by the PCRE2 library, in bytes. More... | |
constexpr auto | bitmaskTopBit {0x80} |
Bit mask to extract the first bit of a multibyte character. More... | |
constexpr auto | bitmaskTopTwoBits {0xc0} |
Bit mask to extract the top two bits of a multibyte character. More... | |
constexpr std::string_view | cDataHead {"<![CDATA["} |
The beginning of a CDATA tag. More... | |
constexpr std::string_view | cDataTail {"]]>"} |
The end of a CDATA tag. More... | |
Namespace for classes handling queries.
|
inline |
Bit mask to extract the first bit of a multibyte character.
Referenced by crawlservpp::Query::RegEx::getAll().
|
inline |
Bit mask to extract the top two bits of a multibyte character.
Referenced by crawlservpp::Query::RegEx::getAll().
|
inline |
The beginning of a CDATA tag.
Referenced by crawlservpp::Query::XPath::getSubSets().
|
inline |
The end of a CDATA tag.
Referenced by crawlservpp::Query::XPath::getSubSets().
|
inline |
The length of the error buffer used by the PCRE2 library, in bytes.
Referenced by crawlservpp::Query::RegEx::getAll(), crawlservpp::Query::RegEx::getBool(), crawlservpp::Query::RegEx::getFirst(), and crawlservpp::Query::RegEx::RegEx().