searxng/searx/engines/unsplash.py

# SPDX-License-Identifier: AGPL-3.0-or-later
# lint: pylint
# pylint: disable=missing-function-docstring
"""Unsplash

"""

from urllib.parse import urlencode, urlparse, urlunparse, parse_qsl
from json import loads

from searx import logger

logger = logger.getChild('unsplash engine')
# about
about = {
    "website": 'https://unsplash.com',
    "wikidata_id": 'Q28233552',
    "official_api_documentation": 'https://unsplash.com/developers',
    "use_official_api": False,
    "require_api_key": False,
    "results": 'JSON',
}

base_url = 'https://unsplash.com/'
search_url = base_url + 'napi/search/photos?'
categories = ['images']
page_size = 20
paging = True


def clean_url(url):
    parsed = urlparse(url)
    query = [(k, v) for (k, v)
             in parse_qsl(parsed.query) if k not in ['ixid', 's']]

    return urlunparse((
        parsed.scheme,
        parsed.netloc,
        parsed.path,
        parsed.params,
        urlencode(query),
        parsed.fragment
    ))


def request(query, params):
    params['url'] = search_url + urlencode({
        'query': query, 'page': params['pageno'], 'per_page': page_size
    })
    logger.debug("query_url --> %s", params['url'])
    return params


def response(resp):
    results = []
    json_data = loads(resp.text)

    if 'results' in json_data:
        for result in json_data['results']:
            results.append({
                'template': 'images.html',
                'url': clean_url(result['links']['html']),
                'thumbnail_src': clean_url(result['urls']['thumb']),
                'img_src': clean_url(result['urls']['raw']),
                'title': result.get('alt_description') or 'unknown',
                'content': result.get('description') or ''
            })

    return results
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 10:31:25 +00:00			`# SPDX-License-Identifier: AGPL-3.0-or-later`
[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`# lint: pylint`
			`# pylint: disable=missing-function-docstring`
			`"""Unsplash`

Adds the Unsplash image engine 2018-10-02 13:08:43 +00:00			`"""`

Drop Python 2 (1/n): remove unicode string and url_utils 2020-08-06 15:42:46 +00:00			`from urllib.parse import urlencode, urlparse, urlunparse, parse_qsl`
Adds the Unsplash image engine 2018-10-02 13:08:43 +00:00			`from json import loads`

[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`from searx import logger`

			`logger = logger.getChild('unsplash engine')`
[enh] engines: add about variable move meta information from comment to the about variable so the preferences, the documentation can show these information 2021-01-13 10:31:25 +00:00			`# about`
			`about = {`
			`"website": 'https://unsplash.com',`
			`"wikidata_id": 'Q28233552',`
			`"official_api_documentation": 'https://unsplash.com/developers',`
			`"use_official_api": False,`
			`"require_api_key": False,`
			`"results": 'JSON',`
			`}`

[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`base_url = 'https://unsplash.com/'`
			`search_url = base_url + 'napi/search/photos?'`
Adds the Unsplash image engine 2018-10-02 13:08:43 +00:00			`categories = ['images']`
			`page_size = 20`
			`paging = True`


Removes what looks like tracking parameters 2018-10-08 12:56:20 +00:00			`def clean_url(url):`
			`parsed = urlparse(url)`
[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`query = [(k, v) for (k, v)`
			`in parse_qsl(parsed.query) if k not in ['ixid', 's']]`
Removes what looks like tracking parameters 2018-10-08 12:56:20 +00:00
[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`return urlunparse((`
			`parsed.scheme,`
			`parsed.netloc,`
			`parsed.path,`
			`parsed.params,`
			`urlencode(query),`
			`parsed.fragment`
			`))`
Removes what looks like tracking parameters 2018-10-08 12:56:20 +00:00

Adds the Unsplash image engine 2018-10-02 13:08:43 +00:00			`def request(query, params):`
[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`params['url'] = search_url + urlencode({`
			`'query': query, 'page': params['pageno'], 'per_page': page_size`
			`})`
			`logger.debug("query_url --> %s", params['url'])`
Adds the Unsplash image engine 2018-10-02 13:08:43 +00:00			`return params`


			`def response(resp):`
			`results = []`
			`json_data = loads(resp.text)`

Uses the raw url for the image result, rather than the full size result. 2018-10-08 12:01:35 +00:00			`if 'results' in json_data:`
			`for result in json_data['results']:`
[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`results.append({`
			`'template': 'images.html',`
			`'url': clean_url(result['links']['html']),`
			`'thumbnail_src': clean_url(result['urls']['thumb']),`
			`'img_src': clean_url(result['urls']['raw']),`
[fix] unsplash engine - 'searx:result: invalid title:' - Use result 'alt_description' as title, if not given use default title 'unknown'. - Use result 'description' from unsplash as 'content' Fix error:: DEBUG:searx:result: invalid title: {..., 'title': None, 'content': '', 'engine': 'unsplash'} Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 15:26:58 +00:00			`'title': result.get('alt_description') or 'unknown',`
			`'content': result.get('description') or ''`
[pylint] searx/engines/unsplash.py, add logger & norm indentation - fix messages from pylint - add logger and log request URL - normalized various indentation Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 14:45:32 +00:00			`})`
[fix] unsplash engine - 'searx:result: invalid title:' - Use result 'alt_description' as title, if not given use default title 'unknown'. - Use result 'description' from unsplash as 'content' Fix error:: DEBUG:searx:result: invalid title: {..., 'title': None, 'content': '', 'engine': 'unsplash'} Signed-off-by: Markus Heiser <markus.heiser@darmarit.de> 2021-05-25 15:26:58 +00:00
Adds the Unsplash image engine 2018-10-02 13:08:43 +00:00			`return results`