Last active
December 21, 2018 09:07
-
-
Save e-maud/f2b605665cf92646b76384b280973ac5 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
## Memo - impresso SOLR queries | |
### OCR quality assessment | |
- range facet over ocrqa | |
``` | |
https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select? | |
facet.field=ocrqa_f& | |
facet.query=ocrqa_f:[* TO 0.2]& | |
facet.query=ocrqa_f:[0.2 TO 0.4]& | |
facet.query=ocrqa_f:[0.4 TO 0.6]& | |
facet.query=ocrqa_f:[0.6 TO 0.8]& | |
facet.query=ocrqa_f:[0.8 TO 0*]& | |
facet=on& | |
fl=id ocrqa_f& | |
q=*:* | |
``` | |
[query link](https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?facet.field=ocrqa_f&facet.query=ocrqa_f:[*%20TO%200.2]&facet.query=ocrqa_f:[0.2%20TO%200.4]&facet.query=ocrqa_f:[0.4%20TO%200.6]&facet.query=ocrqa_f:[0.6%20TO%200.8]&facet.query=ocrqa_f:[0.8%20TO%20*]&facet=on&fl=id%20ocrqa_f&q=*:*) | |
- getting content items with bound ocrqa | |
``` | |
https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select? | |
fl=id ocrqa_f& | |
fq={!frange l=0.8 u=1.0}ocrqa_f& | |
q=*:* | |
``` | |
[query link](https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?fl=id%20ocrqa_f&fq={!frange%20l=0.8%20u=1.0}ocrqa_f&q=*:*) | |
### Topics | |
- Given a topic, find all articles having this topic and sort them according to topic prob: | |
`https://dhlabsrv18.epfl.ch/solr/impresso_dev/select?q=topics_dpfs:tmLETEMPS_tp04_fr&sort=payload(topics_dpf,tmLETEMPS_tp04_fr)%20desc&rows=100&start=0` | |
- Thresholding | |
`https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?q=topics_dpf:tmGDL_tp04_fr&fq={!frange%20l=0.993}payload(topics_dpf,tmGDL_tp04_fr)&fl=topics_dpfs%20meta_year_i%20id&sort=payload(topics_dpfs,tmGDL_tp04_fr)%20desc&rows=100&start=0` | |
- With variables | |
`https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?q=topics_dpfs:tmGDL_tp04_fr&topic_prob=payload(topics_dpfs,tmGDL_tp04_fr)&fq={!frange%20l=0.993}$topic_prob&fl=topics_dpfs%20meta_year_i%20id&sort=$topic_prob%20desc&rows=100&start=0` | |
- Select documents with topic x and y, above a certain treshold: | |
``` | |
https://dhlabsrv18.epfl.ch/solr/impresso_dev/select? facet.field=meta_year_i& <= faceting the whole result on year | |
facet=on& | |
fl=topics_dpfs meta_year_i id p-tmLETEMPS_tp05_fr:payload(topics_dpfs,tmLETEMPS_tp05_fr) p-tmLETEMPS_tp16_fr:payload(topics_dpfs,tmLETEMPS_tp16_fr)& <= field to display, with custom field 'p' to get the payload | |
fq={!frange l=0.01}payload(topics_dpfs,tmLETEMPS_tp16_fr)& | |
fq={!frange l=0.5}payload(topics_dpfs,tmLETEMPS_tp05_fr)& <= filter queries using function range parser ('fr'), setting the lower bound 'l' on 'payload' | |
q=topics_dpfs:tmLETEMPS_tp16_fr AND topics_dpfs:tmLETEMPS_tp05_fr& | |
sort=payload(topics_dpfs,tmLETEMPS_tp05_fr) desc | |
``` | |
[query link](https://dhlabsrv18.epfl.ch/solr/impresso_dev/select?facet.field=meta_year_i&facet=on&fl=topics_dpfs%20meta_year_i%20id%20p-tmLETEMPS_tp05_fr:payload(topics_dpfs,tmLETEMPS_tp05_fr)%20p-tmLETEMPS_tp16_fr:payload(topics_dpfs,tmLETEMPS_tp16_fr)&fq={!frange%20l=0.01}payload(topics_dpfs,tmLETEMPS_tp16_fr)&fq={!frange%20l=0.5}payload(topics_dpfs,tmLETEMPS_tp05_fr)&q=topics_dpfs:tmLETEMPS_tp16_fr%20AND%20topics_dpfs:tmLETEMPS_tp05_fr&sort=payload(topics_dpfs,tmGDL_tp05_fr)%20desc) | |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment