|
crawlserv++
[under development]
Application for crawling and analyzing textual content of websites.
|
Namespace for classes handling queries. More...
Classes | |
| class | Container |
| Query container. More... | |
| class | JsonPath |
| Implements a JSONPath query using the jsoncons library. More... | |
| class | JsonPointer |
Implements an extended JSONPointer query using the rapidJSON library. More... | |
| class | RegEx |
| Implements a RegEx query using the PCRE2 library. More... | |
| class | XPath |
| Implements a XPath query using the pugixml library. More... | |
Constants | |
| constexpr auto | pcre2ErrorBufferLength {1024} |
| The length of the error buffer used by the PCRE2 library, in bytes. More... | |
| constexpr auto | bitmaskTopBit {0x80} |
| Bit mask to extract the first bit of a multibyte character. More... | |
| constexpr auto | bitmaskTopTwoBits {0xc0} |
| Bit mask to extract the top two bits of a multibyte character. More... | |
| constexpr std::string_view | cDataHead {"<![CDATA["} |
| The beginning of a CDATA tag. More... | |
| constexpr std::string_view | cDataTail {"]]>"} |
| The end of a CDATA tag. More... | |
Namespace for classes handling queries.
|
inline |
Bit mask to extract the first bit of a multibyte character.
Referenced by crawlservpp::Query::RegEx::getAll().
|
inline |
Bit mask to extract the top two bits of a multibyte character.
Referenced by crawlservpp::Query::RegEx::getAll().
|
inline |
The beginning of a CDATA tag.
Referenced by crawlservpp::Query::XPath::getSubSets().
|
inline |
The end of a CDATA tag.
Referenced by crawlservpp::Query::XPath::getSubSets().
|
inline |
The length of the error buffer used by the PCRE2 library, in bytes.
Referenced by crawlservpp::Query::RegEx::getAll(), crawlservpp::Query::RegEx::getBool(), crawlservpp::Query::RegEx::getFirst(), and crawlservpp::Query::RegEx::RegEx().