pleroma

mirror of https://git.pleroma.social/pleroma/pleroma.git synced 2024-06-27 09:30:37 +00:00

Author	SHA1	Message	Date
Mark Felder	b135fa35a1	RichMedia: test that activity is streamed out	2024-06-24 09:47:16 -04:00
Mark Felder	634e3d4155	Add test validating the activity_id is correctly present in the Oban job This was preventing the activity from being streamed over websockets.	2024-06-23 21:45:56 -04:00
Mark Felder	17d04ccc8b	RichMedia backfill processing through Oban	2024-06-19 23:20:22 -04:00
Mark Felder	4dfa50f256	Rename RichMediaExpirationWorker to RichMediaWorker	2024-06-19 22:24:48 -04:00
feld	f44987bd0f	Merge branch 'bugfix/rich_media_config' into 'develop' RichMedia: Respect configuration on status previews See merge request pleroma/pleroma!4130	2024-06-07 20:37:19 +00:00
Haelwenn (lanodan) Monnier	65c8763907	RichMedia: Add extra checks on configuration	2024-05-29 08:02:06 +02:00
Haelwenn (lanodan) Monnier	c16ef40f13	RichMedia: Respect configuration on status previews	2024-05-29 08:02:04 +02:00
Mark Felder	a50c657427	Add a dedicated connection pool for Rich Media Sharing this pool with regular Media is problematic as Rich Media will connect to many different domains and thrash the pool, but regular Media will have predictable connections to the webservers hosting media for the fediverse servers you peer with.	2024-05-27 11:17:02 -04:00
Mark Felder	807782b7f9	Fix rich media parsing some Amazon URLs	2024-05-26 14:02:20 -04:00
Mark Felder	54c2bab25f	Fix module struct matching	2024-05-07 22:27:18 -04:00
Mark Felder	9a83301ff8	Credo	2024-05-07 22:11:19 -04:00
Mark Felder	37c35daba6	Credo	2024-05-07 22:10:49 -04:00
Mark Felder	9b9a32bf74	Fix compile warning warning: "else" clauses will never match because all patterns in "with" will always match lib/pleroma/web/rich_media/parser/ttl/opengraph.ex:10	2024-05-07 21:56:27 -04:00
Mark Felder	5a5a193877	Fix broken Rich Media parsing when the image URL is a relative path	2024-05-07 19:54:56 -04:00
Mark Felder	d21aa1a77c	Respect the TTL returned in OpenGraph tags	2024-05-07 19:54:56 -04:00
Mark Felder	df0734fcbf	Increase the :max_body for Rich Media to 5MB Websites are increasingly getting more bloated with tricks like inlining content (e.g., CNN.com) which puts pages at or above 5MB. This value may still be too low.	2024-05-07 19:54:56 -04:00
Mark Felder	ede414094f	RichMedia refactor Rich Media parsing was previously handled on-demand with a 2 second HTTP request timeout and retained only in Cachex. Every time a Pleroma instance is restarted it will have to request and parse the data for each status with a URL detected. When fetching a batch of statuses they were processed in parallel to attempt to keep the maximum latency at 2 seconds, but often resulted in a timeline appearing to hang during loading due to a URL that could not be successfully reached. URLs which had images links that expire (Amazon AWS) were parsed and inserted with a TTL to ensure the image link would not break. Rich Media data is now cached in the database and fetched asynchronously. Cachex is used as a read-through cache. When the data becomes available we stream an update to the clients. If the result is returned quickly the experience is almost seamless. Activities were already processed for their Rich Media data during ingestion to warm the cache, so users should not normally encounter the asynchronous loading of the Rich Media data. Implementation notes: - The async worker is a Task with a globally unique process name to prevent duplicate processing of the same URL - The Task will attempt to fetch the data 3 times with increasing sleep time between attempts - The HTTP request obeys the default HTTP request timeout value instead of 2 seconds - URLs that cannot be successfully parsed due to an unexpected error receives a negative cache entry for 15 minutes - URLs that fail with an expected error will receive a negative cache with no TTL - Activities that have no detected URLs insert a nil value in the Cachex :scrubber_cache so we do not repeat parsing the object content with Floki every time the activity is rendered - Expiring image URLs are handled with an Oban job - There is no automatic cleanup of the Rich Media data in the database, but it is safe to delete at any time - The post draft/preview feature makes the URL processing synchronous so the rendered post preview will have an accurate rendering Overall performance of timelines and creating new posts which contain URLs is greatly improved.	2024-05-07 19:54:56 -04:00
Mark Felder	9f2319e50d	RichMedia.Helpers: move the validate_page_url/1 function to the Parser module This will ensure that the page validation happens in Parser.parse/1 so it can be called from anywhere and still filter invalid URLs.	2024-02-06 18:34:02 -05:00
Mark Felder	6b7b443ff9	Pleroma.Web.RichMedia.Parser: Remove test-specific codepaths Also consolidate Tesla mocks into the HttpRequestMock module. Tests were not exercising the real codepaths. The Rich Media Preview only works with https, but most of these tests were only mocking http.	2024-02-06 18:33:54 -05:00
Mark Felder	0cc038b67c	Ensure URLs with IP addresses for the host do not generate previews	2024-02-05 00:09:37 -05:00
Mark Felder	579561e97b	URI.authority is deprecated	2024-02-04 23:49:07 -05:00
Mark Felder	04fc4eddaa	Fix Rich Media Previews for updated activities The Rich Media Previews were not regenerated when a post was updated due to a cache invalidation issue. They are now cached by the activity id so they can be evicted with the other activity cache objects in the :scrubber_cache.	2024-02-04 23:47:04 -05:00
Haelwenn	251c455b91	Merge branch 'deps-bump' into 'develop' Bump dependencies See merge request pleroma/pleroma!4044	2024-01-29 17:43:00 +00:00
Mark Felder	06b8923d42	RichMedia.Parser.TTL.AwsSignedUrl: dialyzer fix lib/pleroma/web/rich_media/parser/ttl/aws_signed_url.ex:9:callback_type_mismatch Type mismatch for @callback ttl/2 in Pleroma.Web.RichMedia.Parser.TTL behaviour. Expected type: nil \| integer() Actual type: {:error, <<_::64, _::size(8)>>} \| {:ok, integer()}	2024-01-26 17:37:32 -05:00
Mark Felder	5b95abaeea	Credo.Check.Readability.PredicateFunctionNames This check was recently improved in Credo and it does make sense for readability. The offending functions in Pleroma have been renamed and a couple missing the ? suffix have been fixed as well.	2024-01-26 16:59:58 -05:00
Mark Felder	09ae0ab24a	Fix invalid type lib/pleroma/web/rich_media/parser.ex:105:unknown_type Unknown type: Integer.t/0.	2024-01-20 17:16:10 -05:00
Mark Felder	467a65af90	Fix invalid types lib/pleroma/web/rich_media/parser/ttl.ex:6:unknown_type Unknown type: Integer.t/0. lib/pleroma/web/rich_media/parser/ttl.ex:6:unknown_type Unknown type: Map.t/0.	2024-01-20 17:14:56 -05:00
Mark Felder	9896b64f54	Elixir 1.15: Chase the Logger.warn deprecation	2023-12-20 20:16:26 +00:00
Lain Soykaf	00def0875b	RichMediaTest: Use mocked config	2023-12-12 13:28:11 +04:00
Mark Felder	0d68804aa7	Filter OEmbed HTML tags	2023-05-26 19:54:24 +02:00
lain	e853cfe7c3	Revert "Merge branch 'copyright-bump' into 'develop'" This reverts merge request !3825	2023-01-02 20:38:50 +00:00
marcin mikołajczak	10886eeaa2	Bump copyright year Signed-off-by: marcin mikołajczak <git@mkljczk.pl>	2023-01-01 12:13:06 +01:00
Sean King	17aa3644be	Copyright bump for 2022	2022-02-25 23:11:42 -07:00
Haelwenn (lanodan) Monnier	c4439c630f	Bump Copyright to 2021 grep -rl '# Copyright © .* Pleroma' * \| xargs sed -i 's;Copyright © .* Pleroma .*;Copyright © 2017-2021 Pleroma Authors <https://pleroma.social/>;'	2021-01-13 07:49:50 +01:00
lain	e1e7e4d379	Object: Rework how Object.normalize works Now it defaults to not fetching, and the option is named.	2021-01-04 13:38:31 +01:00
lain	713612c377	Cachex: Make caching provider switchable at runtime. Defaults to Cachex.	2020-12-18 17:44:46 +01:00
Alexander Strizhakov	8d218ebaf5	Moving some background jobs into simple tasks - fetching activity data - attachment prefetching - using limiter to prevent overload	2020-11-11 13:39:49 +03:00
Alexander Strizhakov	fc7151a9c4	more files renamings	2020-10-13 16:38:19 +03:00
Alexander Strizhakov	103f3dcb9e	rich media parser ttl files consistency	2020-10-13 16:38:15 +03:00
Mark Felder	8539e386c3	Add missing Copyright headers	2020-10-12 12:00:50 -05:00
Mark Felder	ba7f9459b4	Revert Rich Media censorship for sensitive statuses The #NSFW hashtag test was broken anyway.	2020-09-28 18:22:59 -05:00
rinpatch	db80b9d630	RichMedia: Fix log spam on failures and resetting TTL on cached errors	2020-09-17 16:56:39 +03:00
rinpatch	bb407edce4	RichMedia: fix a compilation error due to nonexistent variable No idea why this passed Gitlab CI	2020-09-14 15:46:00 +03:00
rinpatch	f70335002d	RichMedia: Do a HEAD request to check content type/length This shouldn't be too expensive, since the connections are pooled, but it should save us some bandwidth since we won't fetch non-html files and files that are too large for us to process (especially since you can't cancel a request without closing the connection with HTTP1).	2020-09-14 14:45:58 +03:00
rinpatch	f66a15c4a5	RichMedia parser: do not set a cache TTL for unchanging errors	2020-09-14 14:44:25 +03:00
Alexander Strizhakov	696bf09433	passing adapter options directly without adapter key	2020-09-07 19:59:17 +03:00
Alexander Strizhakov	a83916fdac	adapter options unification not needed options deletion	2020-09-07 19:59:17 +03:00
lain	fdab01ab56	Merge branch 'fix/rich-media-fake-statuses' into 'develop' Rich Media: Do not cache URLs for preview statuses Closes #1987 See merge request pleroma/pleroma!2956	2020-09-07 10:19:19 +00:00
rinpatch	170599c390	RichMedia: do not log webpages missing metadata as errors Also fixes the return value of Parser.parse on errors, previously was just `:ok` due to the logger call in the end	2020-09-05 22:05:35 +03:00
rinpatch	e198ba492e	Rich Media: Do not cache URLs for preview statuses Closes #1987	2020-09-05 20:53:46 +03:00

1 2 3

139 commits