Tantivy – A Full-text, Horse-speed Search Engine Library in Rust

Tantivy – A Full-text, Horse-speed Search Engine Library in Rust

github.com github.com3 months ago in #Silicon Valley Love48

Tantivy is a complete-textual content seek engine library written in Rust. It is nearer to Apache Lucene than to Elasticsearch or Apache Solr in the sense it isn’t an off-the-shelf seek engine server, however reasonably a crate that can be utilized to construct one of these seek engine. Tantivy is, in reality, strongly impressed via Lucene’s design. If you’re in search of an alternative choice to Elasticsearch or Apache Solr, take a look at Quickwit, our seek engine constructed on most sensible of Tantivy. The following benchmark breakdowns efficiency for various kinds of queries/collections. Your mileage WILL range relying at the nature of queries and their load. Full-text seek Configurable tokenizer (stemming to be had for 17 Latin languages with 3rd birthday celebration improve for Chinese (tantivy-jieba and cang-jie), Japanese (lindera, Vaporetto, and tantivy-tokenizer-tiny-segmenter) and Korean (lindera lindera-ko-dic-builder) Fast (take a look at the 🐎 ✨ benchmark ✨ 🐎) Tiny startup time (= 1.27) and helps Linux, macOS, and Windows. Tantivy’s easy seek instance tantivy-cli and its educational – tantivy-cli is a real command-line interface that makes it simple so that you can create a seek engine, index paperwork, and seek by means of the CLI or a small server with a REST API. It walks you thru getting a Wikipedia seek engine up and working in a couple of mins. Reference document for the ultimate launched model There are some ways to improve this challenge. Use Tantivy and let us know about your enjoy on Discord or via e-mail (paul.masurel@gmail.com) Report insects Write a weblog publish Help with documentation via asking questions or filing PRs Contribute code (you’ll sign up for our Discord server) Talk about Tantivy round you We use the GitHub Pull Request workflow: reference a GitHub price tag and/or come with a complete dedicate message when opening a PR. Clone and construct in the neighborhood Tantivy compiles on strong Rust however calls for Rust = 1.27. To take a look at and run assessments, you’ll merely run: git clone https://github.com/quickwit-oss/tantivy.git cd tantivy shipment construct Run assessments Some assessments won’t run with simply shipment verify as a result of fail-rs. To run the assessments exhaustively, run ./run-assessments.sh. Debug You would possibly in finding it helpful to step in the course of the programme with a debugger. A failing verify Make certain you haven’t run shipment blank after the latest shipment verify or shipment construct to be sure that the objective/ listing exists. Use this bash script to search out the call of the latest debug construct of Tantivy and run it beneath rust-gdb: in finding goal/debug/ -maxdepth 1 -executable -type f -name “tantivy*” -printf ‘%TY-%Tm-%Td %TT %pn’ | type -r | lower -d ” ” -f 3 | xargs -I RECENT_DBG_TANTIVY rust-gdb RECENT_DBG_TANTIVY Now that you’re in rust-gdb, you’ll set breakpoints on strains and strategies that fit your supply code and run the debug executable with flags that you just most often go to shipment verify like this: $gdb run –test-threads 1 –test $NAME_OF_TEST An instance By default,  » Read More

Like to keep reading?

This article first appeared on github.com. If you'd like to keep reading, follow the white rabbit.

View Full Article

Leave a Reply