Skip to content

Instantly share code, notes, and snippets.

@e-maud
Last active December 21, 2018 09:07
Show Gist options
  • Save e-maud/f2b605665cf92646b76384b280973ac5 to your computer and use it in GitHub Desktop.
Save e-maud/f2b605665cf92646b76384b280973ac5 to your computer and use it in GitHub Desktop.
## Memo - impresso SOLR queries
### OCR quality assessment
- range facet over ocrqa
```
https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?
facet.field=ocrqa_f&
facet.query=ocrqa_f:[* TO 0.2]&
facet.query=ocrqa_f:[0.2 TO 0.4]&
facet.query=ocrqa_f:[0.4 TO 0.6]&
facet.query=ocrqa_f:[0.6 TO 0.8]&
facet.query=ocrqa_f:[0.8 TO 0*]&
facet=on&
fl=id ocrqa_f&
q=*:*
```
[query link](https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?facet.field=ocrqa_f&facet.query=ocrqa_f:[*%20TO%200.2]&facet.query=ocrqa_f:[0.2%20TO%200.4]&facet.query=ocrqa_f:[0.4%20TO%200.6]&facet.query=ocrqa_f:[0.6%20TO%200.8]&facet.query=ocrqa_f:[0.8%20TO%20*]&facet=on&fl=id%20ocrqa_f&q=*:*)
- getting content items with bound ocrqa
```
https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?
fl=id ocrqa_f&
fq={!frange l=0.8 u=1.0}ocrqa_f&
q=*:*
```
[query link](https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?fl=id%20ocrqa_f&fq={!frange%20l=0.8%20u=1.0}ocrqa_f&q=*:*)
### Topics
- Given a topic, find all articles having this topic and sort them according to topic prob:
`https://dhlabsrv18.epfl.ch/solr/impresso_dev/select?q=topics_dpfs:tmLETEMPS_tp04_fr&sort=payload(topics_dpf,tmLETEMPS_tp04_fr)%20desc&rows=100&start=0`
- Thresholding
`https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?q=topics_dpf:tmGDL_tp04_fr&fq={!frange%20l=0.993}payload(topics_dpf,tmGDL_tp04_fr)&fl=topics_dpfs%20meta_year_i%20id&sort=payload(topics_dpfs,tmGDL_tp04_fr)%20desc&rows=100&start=0`
- With variables
`https://dhlabsrv18.epfl.ch/solr/impresso_sandbox/select?q=topics_dpfs:tmGDL_tp04_fr&topic_prob=payload(topics_dpfs,tmGDL_tp04_fr)&fq={!frange%20l=0.993}$topic_prob&fl=topics_dpfs%20meta_year_i%20id&sort=$topic_prob%20desc&rows=100&start=0`
- Select documents with topic x and y, above a certain treshold:
```
https://dhlabsrv18.epfl.ch/solr/impresso_dev/select? facet.field=meta_year_i& <= faceting the whole result on year
facet=on&
fl=topics_dpfs meta_year_i id p-tmLETEMPS_tp05_fr:payload(topics_dpfs,tmLETEMPS_tp05_fr) p-tmLETEMPS_tp16_fr:payload(topics_dpfs,tmLETEMPS_tp16_fr)& <= field to display, with custom field 'p' to get the payload
fq={!frange l=0.01}payload(topics_dpfs,tmLETEMPS_tp16_fr)&
fq={!frange l=0.5}payload(topics_dpfs,tmLETEMPS_tp05_fr)& <= filter queries using function range parser ('fr'), setting the lower bound 'l' on 'payload'
q=topics_dpfs:tmLETEMPS_tp16_fr AND topics_dpfs:tmLETEMPS_tp05_fr&
sort=payload(topics_dpfs,tmLETEMPS_tp05_fr) desc
```
[query link](https://dhlabsrv18.epfl.ch/solr/impresso_dev/select?facet.field=meta_year_i&facet=on&fl=topics_dpfs%20meta_year_i%20id%20p-tmLETEMPS_tp05_fr:payload(topics_dpfs,tmLETEMPS_tp05_fr)%20p-tmLETEMPS_tp16_fr:payload(topics_dpfs,tmLETEMPS_tp16_fr)&fq={!frange%20l=0.01}payload(topics_dpfs,tmLETEMPS_tp16_fr)&fq={!frange%20l=0.5}payload(topics_dpfs,tmLETEMPS_tp05_fr)&q=topics_dpfs:tmLETEMPS_tp16_fr%20AND%20topics_dpfs:tmLETEMPS_tp05_fr&sort=payload(topics_dpfs,tmGDL_tp05_fr)%20desc)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment