Commons:Village pump/Technical

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Shortcuts: COM:VP/T • COM:VPT

Welcome to the Village pump technical section
Technical discussion
Village pump/Technical
 Bug reports
 Code review
Tools
 Tools/Directory
 Idea Lab



This page is used for technical questions relating to the tools, gadgets, or other technical issues about Commons; it is distinguished from the main Village pump, which handles community-wide discussion of all kinds. The page may also be used to advertise significant discussions taking place elsewhere, such as on the talk page of a Commons policy. Recent sections with no replies for 30 days and sections tagged with {{Section resolved|1=--~~~~}} may be archived; for old discussions, see the archives; recent archives: /Archive/2024/08 /Archive/2024/09.

Please note
 
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 1 day and sections whose most recent comment is older than 30 days.

The XML in the uploaded file could not be parsed

[edit]

Hello! I wanted to created some map. I got free baselayer in PNG, opened Inkscape and made import of PNG file in software. After that I've added several lines and symbols and saved the result in SVG. If I try to upload the result to Commons, I see "The XML in the uploaded file could not be parsed". One hypothesis is that problem is in embedded PNG-layer, but, as I remember, there are SVG-files in Commons, which contain raster layers. Size of file is 12 Mb. Microsoft Edge opens file normally. What does cause the uploading error? It is possible to download the file for its checking. Perhaps, there is some web service, which cand repair structure of document, if it is broken? But, indeed, I'm not sure, that there file is broken: it is simple (raster layer, a few lines and symbols) and is not huge. Dinamik (talk) 09:44, 27 July 2024 (UTC)[reply]

We do not allow uploads of svgs with images inside of them. Its is often misused and it creates potential security problems because our filescanners do not work on those embedded images. —TheDJ (talkcontribs) 08:07, 28 July 2024 (UTC)[reply]
Did such limitation exist in Commons always? I believe, that, for example, first versions of this file have embedded baselayer. Dinamik (talk) 09:56, 28 July 2024 (UTC)[reply]
Probably not, see Category:Fake SVG. Enhancing999 (talk) 10:16, 28 July 2024 (UTC)[reply]
Commons has always allowed files to have embedded bitmaps, but those bitmaps must use the data: scheme. Files with external URLs are now blocked from uploading. Furthermore, the Commons rasterizer will not fetch external URLs, so such a base layer would no longer display. All the versions of the St. Petersburg map display, so there would not be an external URL. Glrx (talk) 22:57, 28 July 2024 (UTC)[reply]
The file is over 10 MB. At one point, SVG uploads were limited to 10 MB, but I do not believe the is still the case.
The file is mostly an embedded PNG. Following that, there are some path and flowRoot elements. The path elements should be OK, but the flowRoot is not supported. It was described in an SVG 1.2 draft, but that draft was not accepted. The element does not exist in the SVG 2.0 spec.
WMF supports SVG 1.1. Even if you could upload the file, it would not display as you would expect.
I do not see a reason for the XML error. W3's validator finds 67 errors, but they only involve normal Inkscape, sodipodi, and RDF extensions or the bogus flowRoot elements.
Glrx (talk) 23:15, 28 July 2024 (UTC)[reply]
Running rsvg-convert (latest version, 2.58) on that SVG gives an error without the --unlimited option, which is described as "The XML parser has some guards designed to mitigate large CPU or memory consumption in the face of malicious documents. It may also refuse to resolve data: URIs used to embed image data in SVG documents." Dexxor (talk) 07:17, 29 July 2024 (UTC)[reply]
Yes, i think the most likely answer is that mediawiki is not setting LIBXML_PARSEHUGE, which limits the max size of a text nodes and attributes to 10 (Decimal) megabytes. As embedded images are stored as base64 data: urls, this would limit the max size to 10mb after base64 encoding (in practise about 6.98 MiB raw size). As far as I know we fully allow embedded images in SVG if they are under that limit, however they are usually not a good idea. If it was important to commons that these types of files be uploaded, we might be able to add the flag, but I'd prefer to keep the flag off if it isn't really needed. Bawolff (talk) 21:36, 18 August 2024 (UTC)[reply]

Uploadig a large svg file

[edit]

I am tring to upload this svg file. and it does not working. maybe becose it is too large. are there size limitation on svg files? what are they? Can somebody help me to upload it?

Thanks a lot, Aizenr (talk) 10:05, 7 August 2024 (UTC)[reply]

my file is 38MB. this is much smaller then the Maximum file size. So this is probably not the main problem... Any other sugestions? Aizenr (talk) 17:17, 7 August 2024 (UTC)[reply]
@Aizenr: Your SVG contains a large PNG, resulting in the same issue as in #The_XML_in_the_uploaded_file_could_not_be_parsed. --Dexxor (talk) 20:52, 7 August 2024 (UTC)[reply]
thank you very much. I read now the discusion above, but I stell do not undestand what is the solution. Can I chinge something in my file that will solve the problem? If there is now other chice, I can downsize the png image.How much should I downsize it?
Thanks again, Aizenr (talk) 04:47, 8 August 2024 (UTC)[reply]
@Aizenr: I would simply convert the SVG into PNG using rsvg-convert -u or Inkscape, and then upload the PNG (it should also be much smaller, around 3MB). Dexxor (talk) 05:52, 8 August 2024 (UTC)[reply]
thank you for the sujestion. The main aim of this file was to criate an svg version of an existing png file so text in different lengwiges could be added. So I prefer to keep it as svg. If I split the png into 2 pices, do I have the same limitation for each of them or on the total? Thanks again, Rami (Aizenr, talk) 12:33, 8 August 2024 (UTC)[reply]
@Aizenr: The PNG is already split into two pieces. I think your only option is to make the PNGs smaller, like this. ---Dexxor (talk) 20:58, 9 August 2024 (UTC)[reply]
Isn't that problem not one of size, but that Category:Fake SVG are no longer accepted? Enhancing999 (talk) 21:03, 9 August 2024 (UTC)[reply]
You would probably have to split it into more than two pieces. Try and aim so that no piece is larger than 6 megabytes. Bawolff (talk) 21:41, 18 August 2024 (UTC)[reply]

Flickr2Commons tool not working for about 24 hours.

[edit]

I get this message currently-

  • "Wikimedia Toolforge Error"
  • "Our servers are currently experiencing a technical problem. This is probably temporary and should be fixed soon. Please try again later."
  • "tools-proxy-8.tools.eqiad1.wikimedia.cloud"

It has been about 24 hours. Other Users are apparently using this same tool without any problems. Thanks, -- Ooligan (talk) 15:03, 8 August 2024 (UTC)[reply]

Also see
M2k~dewiki (talk) 23:42, 9 August 2024 (UTC)[reply]
Still not working. Is this permanently broken, and I should choose a different way to do these tasks, or will it be coming back? - Jmabel ! talk 19:39, 12 August 2024 (UTC)[reply]
https://phabricator.wikimedia.org/T372451 M2k~dewiki (talk) 15:56, 14 August 2024 (UTC);[reply]
Update: User:DaxServer create a clone of F2C and it was available here, though the initial F2C was still downed. --A1Cafel (talk) 03:21, 18 August 2024 (UTC)[reply]

Special:UncategorizedCategories says it runs once a month. It is now five days beyond that. Even once a month is a long time for this report, which used to run every three days. No doubt partly as a result of that longer interval, we have gone from having about 100-200 such categories early this year to over 1500 in the latest (July 7) report. I've dealt with several hundred of those, and would really like to have a more updated report to work from. As far as I can tell, there is no reasonable way to find uncategorized categories other than this report, and except for the rare false positive, every single uncategorized category genuinely represents a task to be done. - Jmabel ! talk 19:31, 12 August 2024 (UTC)[reply]

Why was it changed to much less frequent runs? I think it would be better if like with Suggested Edits, categories were suggested for these (and also categories that already have categories) – for example using the categories of the Wikipedia items if the category is linked to (these could have cats with matching cats on WMC or cats that have parent cats with matching cats on WMC). Prototyperspective (talk) 23:20, 19 August 2024 (UTC)[reply]
It's updated. Count of 11:11, 22 August 2024 is 1534. Enhancing999 (talk) 12:25, 22 August 2024 (UTC)[reply]
phab:T369024: Seems the outcome is Special:UncategorizedPages got deactivated instead.
@Ladsgroup can't we keep both?
A way to speed it up could be to limit Special:UncategorizedPages to namespace 0. Pages don't need to appear there and on Special:UncategorizedFiles.
FYI: @Bawolff, @Mdaniels5757, @JopkeB. Enhancing999 (talk) 14:00, 22 August 2024 (UTC)[reply]
MediaWiki currently does not support setting Special:UncategorizedPages to use namespaces different than the Content Namespaces Bawolff (talk) 09:27, 23 August 2024 (UTC)[reply]
The query for Uncategorized pages is really strange. I wonder what usecase it's meant for. It's much quicker when it's limited to namespace 0. Adding namespace 6 and filtering that for pages without files is just strange. It seems to be me that the bug needing fixing is to write correct SQL. Enhancing999 (talk) 09:42, 23 August 2024 (UTC)[reply]
Thanks for pinging me. My questions:
  1. What exactly is the problem? What is the frequence of runs yet? @Jmabel: What would be a good frequence for you?
  2. What is the purpose of phab:T369024 exactly? Please explain in plain English. I have not enough knowledge of this tool to understand what the consequences of this task might be. Anyway: Special:UncategorizedPages should still run now and then, it should not be deactivated. If the frequence now is a problem, I think it might run weekly or perhaps even less frequent, but we cannot do without it.
  3. What does "limit Special:UncategorizedPages to namespace 0" mean? And in the next sentence: what does "there" mean? Now it looks like referring twice to the same list.
JopkeB (talk) 09:34, 23 August 2024 (UTC)[reply]
As you noted elsewhere Special:UncategorizedPages included both galleries (namespace 0) and file description pages that are empty (namespace 6). These empty file description pages appear also on Special:UncategorizedFiles.
The query for namespace 6 in Special:UncategorizedPages is really resource intensive at Commons (this lead to the bug above). I can't really think of a cases where it's needed though.
As the bug report isn't very clear, it lead to the wrong "fixes" and Special:UncategorizedPages is currently deactivated. Fixing it correctly should make updating Special:UncategorizedPages much faster. Enhancing999 (talk) 09:51, 23 August 2024 (UTC)[reply]
@Enhancing999: Thanks for your explanation. How would Special:UncategorizedPages get correctly updated, what should be done? Is phab:T369024 making that happen or is it the other way around: is that the ticket implementing the deactivating and should we ask for a new ticket/task? JopkeB (talk) 11:18, 23 August 2024 (UTC)[reply]
BTW, currently there are few pages on Special:UncategorizedPages and Special:UncategorizedFiles, but this is due to several people cleaning them up fairly thoroughly since JopeBe's report above. This does not mean the reports aren't needed or should be run less frequently. Enhancing999 (talk) 09:59, 23 August 2024 (UTC)[reply]
Hi, Page table of Wikimedia Commons currently has more than 110M rows, its categorylinks table has more than 800M rows. Joining these two tables with such conditions is too expensive in our production and can cause issues and bring down our databases. That's why we had to reduce its frequency to once a month. We will eventually migrate these reports to hadoop and bring back its previous frequency but that's far in the future (phab:T309738). In the meantime, you can improve the condition by excluding File namespace in the query (which would make it faster), and then just run it against wikireplicas. Similar to how English Wikipedia builds reports for their needs (en:WP:DBR). Sorry for the inconvenience but we don't really have a choice. ASarabadani (WMF) (talk) 11:38, 23 August 2024 (UTC)[reply]

@JopkeB: I guess I could live with Special:UncategorizedCategories being monthly, though certainly more often is better. Before the most recent report a couple of days ago, we'd gone 6 weeks. When you say "What exactly is the problem?" I'm not sure what you are asking. Are you asking how the page is used? Why it's a problem when it is far out of date? or something else? - Jmabel ! talk 18:55, 23 August 2024 (UTC)[reply]

I have already get an answer to this question: there are technical problems to run lists as frequent as we would like to. JopkeB (talk) 05:14, 24 August 2024 (UTC)[reply]

Could translated pages be hidden from categories?

[edit]

For example see Category:Commons video resources – all those translated pages in that category make it cluttered, hard to go through the pages, and bury pages on the next page.

Would it be possible to hide translated pages so that only one is shown?

  • One could have the links to the translated pages at the top of that page
  • One could automatically open the respective translated page when opening the page depending on one's language settings
  • (and there are more alternatives)

Prototyperspective (talk) 22:00, 16 August 2024 (UTC)[reply]

Much less useful but still useful would be if the translated pages were also hidden or hidable in the File uses on commons section of file pages because it makes it cluttered and hard to see where a file is used; example. Prototyperspective (talk) 10:08, 19 August 2024 (UTC)[reply]
It's possible to place them in a subcategory, but I don't think status quo is necessarily an issue. They should be categorized in any case. Enhancing999 (talk) 12:17, 22 August 2024 (UTC)[reply]
It's currently not a big problem but it makes things far less overseeable and buries things on other pages or beneath the cluttered page. Imagine if there were 300 translated versions of a page which is just roughly the number of languages with a Wikipedia and not even all notable languages, it would make the category barely usable to find and organize things. Manually subcategorizing wouldn't be a good solution because it requires people to spend time manually doing so and new translations will be directly in the category again. Maybe there could be some multilingual redirect page that redirects to whatever language the user has configured if that language version exists and English otherwise? Prototyperspective (talk) 12:35, 22 August 2024 (UTC)[reply]

Vandals often move captions to other languages – detection needed

[edit]

What's going on with people somewhat apparently systematically moving captions from one language to a false one? This is happening frequently and often, if not usually, not detected & reverted by editors.

Could a detection of this please be developed? Compared to other vandalism that is well-detected automatically on Wikipedia by for example ClueBot NG (acc) I think it would be easy to detect if a caption got moved from the original language to another one, if possible with language detection so it also checks if the language it was moved indeed does not match the caption text language. Maybe a better place to ask about this would be the Bot requests or the ClueBot NG talk page / code repo.


Examples

Previously I was wondering how to search my contributions (edit summaries). I was trying to use standardized phrases or terms in edit summaries so I can easily look them up later, for example to replace texts I previously added with templates. I found the following useful tool and because I try to always use term "rvv" when reverting edits that are or seem to most likely be vandalism many examples of such edits can be found here. These are the most recent ones: 1 2 3 4 5 6 7. By the way, I think captions are more a problem or redundant than anything else since there already is the machine-translatable description field (it's useful sometimes when descriptions are long but these could also instead have a short version at the top or be shortened). Another problem is that when undoing changes to captions there is no prefilled edit summary so one has to tediously copy the contributions link of the user and write the edit summary anew. Prototyperspective (talk) 13:04, 17 August 2024 (UTC)[reply]

Given how often this happens (I was able to find a couple of recent instances in a few minutes of looking through RecentChanges) and how weirdly specific of an action it is (it only changes the language of the caption, never its content), I suspect this is a UI/UX issue, not deliberate vandalism, and I suspect it can be addressed by making some changes, like inserting a confirmation dialog when changing the language of an existing caption.
Does anyone know how/where the caption editing interface is implemented, and who's responsible for it? (I also have a couple of gripes about the language picker used in the caption interface - especially its appearance on mobile.) Omphalographer (talk) 05:16, 18 August 2024 (UTC)[reply]
you're right. try File:JPG Test.jpg.
given existing captions, users can change the language and then click publish. that results in the aforementioned problematic edits. RZuo (talk) 05:30, 18 August 2024 (UTC)[reply]
There is the new feature to require captchas for action defined in an abuse filter. I thought about requiring captchas for all IP edits on captions. This might reduce these king of accidental edits. GPSLeo (talk) 05:36, 18 August 2024 (UTC)[reply]
  • Could be but sometimes they do change the text or change multiple languages at once and it seems like it's always only done by new or unregistered users who sometimes did some other different problematic changes.
  • Another thing that could be done is automatic detection of the language and displaying at least a warning or adding the file to a maintenance cat if it doesn't match the specified language – this would also be useful since often people specify the wrong language even at upload.
  • @RZuo: What do you mean? Why would Omphalographer be right in that this is a UI/UX issue? What you described is exactly the expected behavior: changing the language and then clicking publish, how does this suggest it's not vandalism and a UI issue? I don't see why you and GPSLeo think it would be accidental / an UI issue if one has to deliberately click "publish".
  • Even if both captchas are added and they indeed reduce these changes, I still think there should be automatic detection of these changes as well as other likely vandalism. Why is ClueBot_NG not active on WMC? Does ORES work with WMC? There's lots of vandalism here (not just in the captions and the relatively hidden structured data) and I've come across multiple cases where it stayed on a relatively large page for a year or so. Bots/tools could build a queue of edits to check as well as automatically revert edits that are very likely to be vandalism. Moreover, they could learn from edits that specify that the reverted edit was likely vandalism or similarly nonconstructive (it doesn't matter if deliberate or not) if terms like RVV are used.
Prototyperspective (talk) 10:07, 18 August 2024 (UTC)[reply]
People are on a website an they see a language selection. They want to change the language of the text. They do not expect that they are able to change the content on a website where they do not even have an account. The button says publish but how is this translated into different languages and are these translated terms always that clear? GPSLeo (talk) 10:37, 18 August 2024 (UTC)[reply]
This type of "vandalism" is so specific that it would basically have to be a very dedicated LTA who utilizes IPs from all around the world; it's disruptive but I don't think it's deliberate. Gnomingstuff (talk) 04:49, 26 August 2024 (UTC)[reply]
Yes I think the explanation by GPSLeo is quite plausible. The main subject of this thread however is detecting such edits and auto-reverting them (also useful for other unconstructive edits / reducing maintenance workload) regardless of whether or not they are intentional or not. I don't know what you mean by "LTA" but it could have also been many people that found this to be an effective type of vandalism as it's often not detected and reverted but I already think inadvertent edits may be more likely. Maybe there could be some special confirmation box asking if the user really wants to publish that to the file data without using the word "publish" because maybe those users didn't understand that word. In any case, detecting if the text in the caption matches the languages seems useful and needed in any case, for example because many users add captions in English to other languages at upload or similar things. Prototyperspective (talk) 09:52, 26 August 2024 (UTC)[reply]
Something in the GUI needs improvement. It's plausibel that changing the language and saving it actually adds an additional language rather than deleting one as well: [1] Enhancing999 (talk) 10:10, 26 August 2024 (UTC)[reply]
LTA = long-term abuse, people who vandalize in their specific identifiable way for months or years.
The main tells on these are edits by infrequent editors with 2 changes per file. Not much help though once it's out of recent changes. I've found some by searching for captions with mismatched languages e.g. "Spanish the," but obviously that only works with certain patterns.
The problem though with any kind of auto-reverting is that it would have to not catch people fixing this stuff, especially when it's undetected. Gnomingstuff (talk) 11:29, 27 August 2024 (UTC)[reply]
Many don't have 2 changes per file. I think the main indicators are 1. language does not match specified caption language (no other indicator is needed; check language auto-detection of Google Translate or DeepL to see what I mean with language detection) 2. user isn't an editor with many unreverted edits (would only use this indicator early on as language mismatching is a general problem).
I don't understand what you mean with The problem though with any kind of auto-reverting is that it would have to not catch people fixing this stuff. People fixing this stuff would move the caption back to its matching language or remove the flawed captions so they wouldn't be detected. Prototyperspective (talk) 11:51, 27 August 2024 (UTC)[reply]

Geolocate nepal photos

[edit]
Date Name Thumbnail Size Description
09:31, 23 March 2019 Chandragiri Cable Car, 2018-04-21 (2).jpg (file) 5.93 MB User created page with UploadWizard
07:18, 23 March 2019 Chandragiri Cable Car, 2018-04-21 (1).jpg (file) 6.31 MB User created page with UploadWizard

plz take a look at User:Simasuru's photos. i located one of them to 27.685, 85.2138. do you think that's right? once it's confirmed, i will rename the files. RZuo (talk) 05:46, 18 August 2024 (UTC)[reply]

I'd tend to have these tests deleted. Enhancing999 (talk) 12:21, 18 August 2024 (UTC)[reply]
Rename yes, but meets COM:SCOPE, right? --PantheraLeo1359531 😺 (talk) 11:55, 19 August 2024 (UTC)[reply]
@RZuo: it seems to match Google Street View. Of course locating an image retroactively might result in some minor error margins, but it should not be a huge deal. I've moved them to a more appropriate file name now. —Matrix(!) {user - talk? - uselesscontributions} 15:13, 27 August 2024 (UTC)[reply]

Can not find out what causes an error

[edit]

I apologize, first attempt to upload three images: https://commons.wikimedia.org/w/index.php?title=Special:UploadWizard&campaign=CHM-DE-HE&categories=Cultural+heritage+monuments+in+Erbach+%28Odenwald%29&description=Stra%C3%9Fenbr%C3%BCcke%2C+Eisenbahn+%28auf+der+Grenze+zu+Michelstadt%29&descriptionlang=de&fields%5B0%5D=952486&fields%5B1%5D=49.67158%2F8.99068 - I get "Beim Ausfüllen der Formulare sind 2 Fehler aufgetreten. Bitte die Fehler berichtigen und erneut speichern." - 2 Errors occured, please correct and save again - But I have no idea what errors. The only thing red on the page is the browser spellcheck in text fields - How can I find out? Shyof (talk) 15:14, 19 August 2024 (UTC)[reply]

Maybe it's the same problem as this one?: Commons:Village pump#No error message for same file names in Upload Wizard. It should probably be reported at this page instead of here. Prototyperspective (talk) 12:02, 20 August 2024 (UTC)[reply]
Hmm, that could have been possible - uploading the files one-by-one without changing anything else worked (the names differed but had a common prefix)... Thanks for the hint! Shyof (talk) 15:54, 20 August 2024 (UTC)[reply]
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. --廣九直通車 (talk) 10:35, 16 September 2024 (UTC)
[edit]

Hi all, i've noticed that since some days (can't tell exactly :-( ), there's a discrepancy between search results on Commons and PetScan search results based on the very same search. For example, a search on Commons with search string Tschubby map incategory:"Media_missing_infobox_template" currently yields 1,087 hits. Executing exactly the same search on PetScan yields between (!) 1,039 and 1,065 results (I did multiple search runs with same psid). Repeating the search on PetScan yields different results, while the commons search result is stable. This looks like a PetScan issue, but before reporting there, I wonder if others found a similar behaviour, or if I'm doing something wrong. Fl.schmitt (talk) 16:41, 19 August 2024 (UTC)[reply]

I noticed some discrepancies, but they seemed minor. Petscan uses different channels that might fail or occasionally have their own problems. Maybe you want to use the category instead. Enhancing999 (talk) 16:39, 21 August 2024 (UTC)[reply]

Tech News: 2024-34

[edit]

MediaWiki message delivery 00:49, 20 August 2024 (UTC)[reply]

Options to "Use this file" do not appear for some files

[edit]

For some files, the top bar only contains links to the file history and usage. Some examples:

It does not contain the option to download or use the file. What's the reason for this inconsistency? Ixfd64 (talk) 20:30, 20 August 2024 (UTC)[reply]

How to show different contents on mobile?

[edit]

I don't know why nobody is replying at Talk:Main Page#Edit requests but the Main page is referring to the links to the right for years while on mobile, which by now could well be how the majority of people land on this page by now, the links are not "on the right" but down. Is there a way for this part to be different on mobile? Extension:MobileDetect doesn't seem to work on WMC. Prototyperspective (talk) 20:19, 21 August 2024 (UTC)[reply]

You use media queries and the skin classes and fix the templatestyles. —TheDJ (talkcontribs) 18:10, 25 August 2024 (UTC)[reply]
[edit]

With videos it is possible to specify a start (and end) time of the video – see Commons:Video#Temporal media fragments.

  • Is it possible to link to times of the videos in the file description?
  • Is that already possible somehow for audio files?

I think it would be very useful if in the file description of Spoken Wikipedia audio files, like those that I just uploaded, there were links to the different sections of the article. If one is only interested in a particular section of the article one could jump to it directly and listen to only that. It also gives some orientation where one is currently at when listening to articles. Linking to chapters is possible on YouTube and probably many users have already noticed some ways this can be useful. For example, I'd like to link the timestamps in the description here and add timestamp links to the different exercises here.

This doesn't seem to work with videos either as one would have to append ?start=00:26 but can only append things like #start=00:26. Is there some issue about these things? Prototyperspective (talk) 12:46, 22 August 2024 (UTC)[reply]

There isn't really a universal chapter methodology in HTML5 video. The HTML5 idea is that you create your own VTT file (which we don't yet support). These could refer to Commons link (they can be anything you like, as all support for it is completely custom work). Then you write custom Javascript to listen to the timedtext events, read the 'text' (a description and link in this case) and go do something with that (wrapped inside a videojs custom plugin).
Related tickets about this in phabricator. phab:T116154, phab:T301826. —TheDJ (talkcontribs) 18:08, 25 August 2024 (UTC)[reply]
You can link to start points in the file description page using start= and end= query params like:
File:Using_a_Power_Rack_in_bodybuilding,_powerlifting,_strength_training,_resistance_exercise.webm?start=00:10&end=00:15, but it is not that useful, as there is no autoplay etc. —TheDJ (talkcontribs) 18:14, 25 August 2024 (UTC)[reply]
  • The two issues you linked don't seem to be related to this at all. What you wrote is about annotations like a link or a balloon message informing about an inaccuracy in the video at specific times of the video. In contrast this post here is about video chapters and linking to different times of the video in the file description.
  • Yes, I know that one can link to start points that way and explicitly said that in the last paragraph. As said, one would need to link to these from there like #start=00:26 because when including a link like your example, it opens the video in a new tab at that starting point instead of directly jumping to it.
  • This is really important for Spoken Wikipedia where you may like to jump to a particular section. There also needs to be some proper audio player, with the current one only being the fallback, that for example is wider so you can jump to some timing better and with a -10 seconds feature.
Prototyperspective (talk) 10:49, 26 August 2024 (UTC)[reply]

Bot no longer working to warn Wikivoyage about nominations for deletion?

[edit]

As I stated in User talk:IronGargoyle#No notifications to Wikivoyage anymore? (edited for brevity and relevance):

Commons delinker delinked an image on Wikivoyage that was deleted in the Commons:Deletion requests/Files in Category:Patuxai thread. Why was there never a notification on the voy:Talk:Indochina Wars page that this file was nominated for deletion? We're supposed to get such notifications.

Was there a malfunction of the relevant bot, or was a unilateral decision made on Commons to stop giving notice to other sites such as Wikivoyage that use thumbnails of images on Commons in our articles but can choose to locally upload images that we consider important and are problematic merely due to a lack of commercial freedom of panorama? If the bot malfunctioned, please try to find out why and ensure the problem does not recur. However, if a unilateral decision was made to stop giving sister sites such as Wikivoyage the chance to make our own decisions on affected files before they are deleted on this site, I cannot state too strongly that that is absolutely unacceptable! We cannot return to the days when slews of images were deleted from Wikivoyage articles without notice. I'll look forward to your response and explanation of how you will prevent this problem from recurring. Thanks, everyone! -- Ikan Kekek (talk) 04:30, 25 August 2024 (UTC)[reply]

I do not think this is specific to Wikivoyage, there were also bot notifications about imminent deletions in the English Wikipedia, but I do not see them anymore. May the the bot got abandoned, lost the flag or smth else happened. Ymblanter (talk) 18:23, 27 August 2024 (UTC)[reply]
I'm not entirely sure if that wikivoyage was on the bot, however I do know that the bot was recently broken for quite a while: https://phabricator.wikimedia.org/T339145 There was a surprising small amount of communities that adopted/approved that bot when the community tech team finally rewrote it in 2018 btw. Really sad, if you realize how much money the foundation poured into reworking that bot. —TheDJ (talkcontribs) 20:17, 27 August 2024 (UTC)[reply]
We need to be notified when images we use as thumbnails are nominated for deletion. If the bot isn't working, what's the solution? -- Ikan Kekek (talk) 23:03, 27 August 2024 (UTC)[reply]

Croptool connections/authorization

[edit]

Hey guys, I use croptool to correctly size images from Wikipedia commons for use as pagebanner in wikivoyage. Unfortunately, connections have been consistently failing the last couple days. Anyone seen this? Any tips on what I might need to do to get it to work? Mrkstvns (talk) 21:13, 25 August 2024 (UTC)[reply]

See COM:CropTool, the tool should be up now, but it seems like Toolforge had issues over the weekend. Sohom (talk) 14:53, 26 August 2024 (UTC)[reply]

Dark mode fix on Main Page?

[edit]

I've pushed some dark mode fixes on Template:Main Page Template, but it's not being reflected in Main Page for some reason? However it is being reflected in other languages like Tamil (முதற் பக்கம்). Is there a reason why? —Matrix(!) {user - talk? - uselesscontributions} 05:21, 26 August 2024 (UTC)[reply]

Upland Wizard bug

[edit]

I uncovered a bug in the Upload Wizard. Can it be reported somewhere locally, or should I use Phabricator? The Upload Wizard main page and Upload Wizard FAQ are both silent on bug reporting. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:49, 26 August 2024 (UTC)[reply]

The correct place on Wiki would be Commons talk:WMF support for Commons/Upload Wizard Improvements. But in this case this is not an UploadWizard problem. We have an template with the same name as a language code. That is an error that should never happen and we need to delete the template redirect. GPSLeo (talk) 15:07, 26 August 2024 (UTC)[reply]
I moved all uses of the template redirect to the actual template and deleted the redirect. But I am unsure if the language code handling will work without any additional action. GPSLeo (talk) 15:29, 26 August 2024 (UTC)[reply]
Thank you. Did you check whether any of those were genuine attempts at using the language? Looks like we need to replicate {{En}}, {{De}} etc., for the abr language code. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 15:34, 26 August 2024 (UTC)[reply]
Yes, I checked the files before changing the template. 6 files wanted to use the language code. I create the {{Abr}} and it seems to work correct. GPSLeo (talk) 15:44, 26 August 2024 (UTC)[reply]

Subcategory not seen in a category

[edit]

The Category:Books from the United States is categorized in Category:Books by country, but it is nowhere to be seen in that category. What is the reason? -- Jan Kameníček (talk) 16:53, 26 August 2024 (UTC)[reply]

Tech News: 2024-35

[edit]

MediaWiki message delivery 20:28, 26 August 2024 (UTC)[reply]

Addition of Pattypan Userguide on Commons

[edit]

I and a colleague are curating a complete guide for using Pattypan for batch uploads for Wikimedia Commons, from image quality to upload processes to license and creations of hyperlinks copyright approval. I need help understanding how we can upload here on wikimedia for use by everyone Lutarchitecture (talk) 00:44, 27 August 2024 (UTC)[reply]

You could complete Commons:Pattypan?
 ∞∞ Enhancing999 (talk) 15:19, 27 August 2024 (UTC)[reply]
What do you mean by complete Pattypan, please throw more light thanks, like give it more context so I effectively understand you Lutarchitecture (talk) 21:14, 27 August 2024 (UTC)[reply]
That page and its subpage are the guide of Commons for the use of Pattypan.
 ∞∞ Enhancing999 (talk) 21:17, 27 August 2024 (UTC)[reply]

Search operators for audio file size and duration?

[edit]

Because the gallery of new files (linked in the left panel that is shown on all pages under "Latest files") is broken for audio files since it's very cluttered with lots of pronunciation files and not having any way to filter files out within the page, I created this search that to some extent filters out pronunciation files: -intitle:/LL\-Q/ -Pronunciation -deepcategory:"Tamil pronunciation" -deepcategory:"Dutch pronunciation" -deepcategory:"English pronunciation" filetype:audio

There are two problems with it: deepcategory search operator does not work properly so the parent category can't be used (phab:T369808) and it's quite slow because the "intitle" search operator can't deal with "LL-Q" because of the hyphen needing this regex which slows the search down (phab:T371195). It's still good enough that it can be used in practice to see recent audio files that aren't pronunciation files so e.g. audiobooks, spoken Wikipedia, soundscapes, and music tracks as well as lots and lots of copyvios that no human or bot seems to check. To improve it further, I'd like to add a search operators to exclude files with a duration of only a few seconds or a filesize of just a few kB because these are usually also just pronunciation files. It seems like petscan can do so so is there a way to add this to the WMC search? If such search operators don't exist is there some phabricator issue about adding these? They would also be useful in finding for example subcategorize Short films videos in Films categories and many other things. Prototyperspective (talk) 10:16, 28 August 2024 (UTC)[reply]

@Prototyperspective: You can search by file size in kilobytes using filesize:. See mw:Help:CirrusSearch#File properties search. But there doesn't seem to be a property for duration. You can see what the search system can use by appending ?action=cirrusdump to a page URL, and trying that on a random audio file I can't see anything that looks like the duration. It seems to me that duration would be an obvious measure alongside height and width, but I'm not sure where I'd suggest that. --bjh21 (talk) 11:22, 28 August 2024 (UTC)[reply]
Ups, thanks apparently I only ctrl+F searched for "file size" with a space but not "filesize" on that help page. I added filesize:>100 to the search string.
I'll make a phabricator issue asking about a search operator for duration then because even if that's not needed for filtering out pronunciation audio files it would still be useful for finding short films and many other use-cases I haven't thought of. Prototyperspective (talk) 12:54, 28 August 2024 (UTC)[reply]
This ?action=cirrusdump thing seems very useful so I think it should be included in the help page properly, could you add it there (currently it's only buried in the notes of some reference). Another issue is that I can't find out how to specify the sort order with the search operators (asked about it at Template talk:Search link) – how could one specify sort=create_timestamp_desc as sort order? Prototyperspective (talk) 13:00, 28 August 2024 (UTC)[reply]

How to distinguish categories set by the Infobox from other cats?

[edit]

It would be great if somebody could create a report that lists categories that don't have any categories except for meta-categories set by the Wikidata Infobox like Category:Uses of Wikidata Infobox with no image as proposed here. Example

Is there some way to distinguish categories set by the Infobox from other categories?

It would be best if one could also distinguish between meta-categories set by the WD Infobox like the one above and other categories set by the WD Infobox like Category:Lonnie (given name) (from example) because some categories that only have categories set by the WD Infobox don't need any other categories so would best show up in some separate less important report at some point (and it may even be desirable to increase cats that only have cats set by their WD Infobox). Prototyperspective (talk) 19:47, 30 August 2024 (UTC)[reply]

Cat-a-lot performance, maintenance

[edit]

Cat-a-lot seems very slow, since a few days. For example It takes 8 mins to edit a batch of 500 files (locking the tab). Can this be confirmed to be a server or a scripting issue and checked and fixed for speed. rollback ? It should not be my bandwith, but maybe advice on a local setting? Thank you Peli (talk) 12:21, 1 September 2024 (UTC)[reply]

I have the same experience. Very slow. Wouter (talk) 12:51, 1 September 2024 (UTC)[reply]
Please see this thread. Prototyperspective (talk) 21:59, 1 September 2024 (UTC)[reply]

Tech News: 2024-36

[edit]

MediaWiki message delivery 01:02, 3 September 2024 (UTC)[reply]

upload wizard for books

[edit]

Is there a campaign interface (Upload Wizard configuration) that fills in {{Book}} instead of {{Information}}?

It could make it easier for people to understand files like the ones in the Chinese categories.
 ∞∞ Enhancing999 (talk) 12:30, 3 September 2024 (UTC)[reply]

Audio of music contain copyvio thumbnails

[edit]

The thumbnails are not showing up at the audio file but the thumbnail is embedded in them. However, they are embedded in the file and when downloading the file one can see or extract them. Example.

  1. Many of these thumbnails are copyrighted. This means usually the thumbnail would need to be removed. video2commons already imports audio files without the thumbnails. Could there be some script or bot that categorized all audio files with a thumbnail set into e.g. Category:Audio files with embedded thumbnail?
  2. Then as a next step one could remove all of them at scale and efficiently using some metadata removal tool, for example similar to command eyeD3 --remove-all-images **/*.opus (applied to all audio files in some category). I guess it would be best to not remove the thumbnail for identified cases where the thumbnail is CCBY as well, these could e.g. be moved to another category or audio files whose thumbnails should be removed to a subcategory of the category above. (A more sophisticated method would be to reverse image search each thumbnail for finds via tineye so only non-original works are deleted and thumbnails created by the person licensing the work under CCBY kept (if the CCBY license also applies to the thumbnail) but I don't think this would be necessary as it would cause a lot of manual work of checking whether it's indeed a copyvio and whether thumbnails without reverse search result are indeed not copyvios.)

Just as a note: the audio files of the example display 0:00 as duration instead of the duration which only shows after one has clicked play. Prototyperspective (talk) 00:07, 4 September 2024 (UTC)[reply]

When removing the thumbnail one could replace it with a link that enables people to easily download the thumbnail again from some metadata provider. So they should just contain a link or an ID with which to fetch the thumbnail but not a thumbnail image. Prototyperspective (talk) 15:12, 6 September 2024 (UTC)[reply]

A bot that moves categories to the bottom of the page

[edit]

Many of my files have the categories in the description. There’s too many to manually move, but is there a way to move them to the bottom with a bot as is done on Wikipedia? Immanuelle ❤️💚💙 (please tag me) 07:37, 8 September 2024 (UTC)[reply]

Example? Prototyperspective (talk) 09:53, 8 September 2024 (UTC)[reply]
@Immanuelle: You can almost certainly use COM:VFC to do it, but I'd suggest not bothering. The categories will work properly wherever they are, so no-one will care unless they're actually reading the wikitext. And if they're reading the wikitext they can fix it themselves. --bjh21 (talk) 11:11, 15 September 2024 (UTC)[reply]

Tech News: 2024-37

[edit]

MediaWiki message delivery 18:48, 9 September 2024 (UTC)[reply]

Troubleshooting needed for File:AMD Zen.svg

[edit]

Can someone investigate why the SVG graphic image File:AMD Zen.svg suddenly stopped working? On the Wikipedia pages where it's used, it's just a blank grey image, and if I click on it, it says "Sorry, the file cannot be displayed - There seems to be a technical issue. You can retry if it persists. Error: could not load image from https://upload.wikimedia.org/wikipedia/commons/thumb/9/9f/AMD_Zen.svg/800px-AMD_Zen.svg.png". Then when I go to Commons where it's hosted, I just see a link to "File:AMD Zen.svg" in place of where the image should be. Opening the link brings me to a page with the error "XML Parsing Error: prefix not bound to a namespace", and trying to open a lower-resolution render just results in a random WMF error like "server technical issue" or "Too many requests, try again later".

Obviously I've had a look at the file upload and page history for this item and there doesn't seem to be any recent changes (or vandalism) that could have caused this to happen. AP 499D25 (talk) 07:53, 13 September 2024 (UTC)[reply]

no xlink namespace declaration. We will need to wait for Commons image scalars to quiet down. Glrx (talk) 20:03, 13 September 2024 (UTC)[reply]
Now it's working again. Kinda bizarre that the other similar-looking files File:AMD Threadripper.svg and File:AMD Epyc.svg were still working at the time, which led me to think that perhaps there was a programming error or some code change that broke File:AMD Zen.svg. AP 499D25 (talk) 03:12, 14 September 2024 (UTC)[reply]

For some reason when I attempted to open File:Typhoon-Yagi 5.jpg, I get nothing but File not found: /v1/AUTH_mw/wikipedia-commons-local-public.d5/d/d5/Typhoon-Yagi_5.jpg. Neither switching browser nor clearing cache help the problem. Initially I thought the file was broken, but Túrelio informed me that he can access the file without problem.

From the archives there appears that there are 2 similar problems. One in February 2022 was resolved by clearing cache, while another in August 2022 ended up in Phabricator. I'd like to ask are there anyone having similar problems, and should I report the matter to Phabricator? Many thanks.廣九直通車 (talk) 13:02, 14 September 2024 (UTC)[reply]

Works perfectly well to me. Are you still getting the error message? — Alien  3
3 3
14:01, 14 September 2024 (UTC)[reply]
Well I just clicked on it and I'm getting the same error myself too! It must be related to my posting about File:AMD Zen.svg above. AP 499D25 (talk) 14:22, 14 September 2024 (UTC)[reply]
Maybe a browser issue? I'm on Firefox, and you? — Alien  3
3 3
14:30, 14 September 2024 (UTC)[reply]
i can also access it, both file page and original file https://upload.wikimedia.org/wikipedia/commons/d/d5/Typhoon-Yagi_5.jpg . using firefox on windows 11. RZuo (talk) 15:46, 14 September 2024 (UTC)[reply]
  • I'm on Win 11. File page does not display image with Chrome or Edge. Loading directly in Chrome, I get "File not found: /v1/AUTH_mw/wikipedia-commons-local-public.d5/d/d5/Typhoon-Yagi_5.jpg" out of the cache.
access-control-allow-origin: *
access-control-expose-headers: Age, Date, Content-Length, Content-Range, X-Content-Duration, X-Cache
age: 480
content-length: 85
content-type: text/html; charset=UTF-8
date: Sat, 14 Sep 2024 17:58:37 GMT
nel: { "report_to": "wm_nel", "max_age": 604800, "failure_fraction": 0.05, "success_fraction": 0.0}
report-to: { "group": "wm_nel", "max_age": 604800, "endpoints": [{ "url": "https://intake-logging.wikimedia.org/v1/events?stream=w3c.reportingapi.network_error&schema_uri=/w3c/reportingapi/network_error/1.0.0" }] }
server: envoy
server-timing: cache;desc="hit-front", host;desc="cp4052"
strict-transport-security: max-age=106384710; includeSubDomains; preload
timing-allow-origin: *
x-cache: cp4052 miss, cp4052 hit/4
x-cache-status: hit-front
x-content-type-options: nosniff
File page and JPEG display with Firefox 130.0 (64-bit).
Glrx (talk) 18:12, 14 September 2024 (UTC)[reply]
Different users get (or don't get) the same file from different servers?
 ∞∞ Enhancing999 (talk) 23:58, 14 September 2024 (UTC)[reply]
Update: tried Safari on iOS, also failed. Probably best to be dealt on Phabricator?廣九直通車 (talk) 06:58, 15 September 2024 (UTC)[reply]
Phabricator bug report filed at phab:T374773, FYI.廣九直通車 (talk) 07:19, 15 September 2024 (UTC)[reply]
Johannnes89 on Phabricator reported that he has no problem in accessing the file with Chrome and Safari, presumably in his home in Germany. Like to ask where did you access the file?廣九直通車 (talk) 09:24, 15 September 2024 (UTC)[reply]
Yes I accessed it from Germany indeed, so the issue might be about accessing it from different servers. Johannnes89 (talk) 09:37, 15 September 2024 (UTC)[reply]
Just tried it from Edge, and it worked.
accept-ranges: bytes
access-control-allow-origin: *
access-control-expose-headers: Age, Date, Content-Length, Content-Range, X-Content-Duration, X-Cache
age: 0
content-length: 7434740
content-type: image/jpeg
date: Sun, 15 Sep 2024 14:55:18 GMT
etag: fe68fa2d2c9fb9101db078cb263815cb
last-modified: Fri, 13 Sep 2024 09:57:44 GMT
nel: { "report_to": "wm_nel", "max_age": 604800, "failure_fraction": 0.05, "success_fraction": 0.0}
report-to: { "group": "wm_nel", "max_age": 604800, "endpoints": [{ "url": "https://intake-logging.wikimedia.org/v1/events?stream=w3c.reportingapi.network_error&schema_uri=/w3c/reportingapi/network_error/1.0.0" }] }
server: envoy
server-timing: cache;desc="miss", host;desc="cp1115"
strict-transport-security: max-age=106384710; includeSubDomains; preload
timing-allow-origin: *
x-cache: cp1115 miss, cp1115 miss
x-cache-status: miss
x-content-type-options: nosniff
x-object-meta-sha1base36: l1h10jxvtd5o73z4q51fcqsot4fy2wu
Glrx (talk) 14:59, 15 September 2024 (UTC)[reply]
server cp1115 seems to have the file, but not cp4052
 ∞∞ Enhancing999 (talk) 15:07, 15 September 2024 (UTC)[reply]
As of 16:00 UTC+8, I can now access the file without problem in Hong Kong. Will like to hear if anyone elsewhere still has trouble in accessing the file?廣九直通車 (talk) 08:14, 16 September 2024 (UTC)[reply]
Now that the task has been resolved on Phabricator, I think it's time to resolve and archive this thread. Thanks for all of your comments.廣九直通車 (talk) 10:33, 16 September 2024 (UTC)[reply]
Curious how often this happens. Apparently there is a weekly process to fix it, see phab:T374773#10147831.
 ∞∞ Enhancing999 (talk) 10:43, 16 September 2024 (UTC)[reply]

Reupload crashed midway

[edit]

On Poems Betham p9.jpg. I redid the colors, tried to reupload it, it lagged for a few minutes then crashed. A new version of the file has been added to the upload history, but the file itself is still exactly the same (including after purge), and when I try to reupload the corrected version, it gets refused as a duplicate of the "current version" of the file, which it is not. What should I do? — Alien  3
3 3
13:59, 14 September 2024 (UTC)[reply]

Undid the upload, redid the upload, all good now. Whatever... — Alien  3
3 3
14:54, 14 September 2024 (UTC)[reply]
Caching issue? I did see two different files when there were just two versions.
 ∞∞ Enhancing999 (talk) 15:43, 14 September 2024 (UTC)[reply]
No, purged everything twice, still didn't work. Once I undid and redid it, though, it started pretending that the first try worked. — Alien  3
3 3
15:52, 14 September 2024 (UTC)[reply]