Page MenuHomePhabricator

Add mochila_images.s3.amazonaws.com to the wgCopyUploadsDomains whitelist
Closed, ResolvedPublic

Description

Author: ayaita17

Description:
please add the following domain(s) to the wgCopyUploadsDomains whitelist:

This is an example:

https://www.dropbox.com/sh/qziqxh4fs2id69d/AACf_EBbByO6-GeoZbfRotr3a/10_LagoAlajuela.jpg


Version: wmf-deployment
Severity: normal

Details

Reference
bz67344

Related Objects

View Standalone Graph
This task is connected to more than 200 other tasks. Only direct parents and subtasks are shown here. Use View Standalone Graph to show more of the graph.
StatusSubtypeAssignedTask
ResolvedNone
ResolvedNone

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 3:39 AM
bzimport set Reference to bz67344.
bzimport added a subscriber: Unknown Object (MLST).

What? Dropbox? We're not going to add that. What exactly do you want this for?

ayaita17 wrote:

Why do you want to know if you are already saying you are not adding it?

I am not a super tech savvy user and don't know what you mean by "what? Dropbox?"

I am trying to use the GWToolset to mass upload pictures to Commons. We don't have the pictures online and the only alternative we found was to upload them to a dropbox folder. Then, when using the GWToolset we noticed the white list of domains and this led me to make this request, as suggested here:
http://gwtoolset.wmflabs.org/wiki/Special:GWToolset

You'll need to find some other place to upload to. A good option may be an amazon s3 bucket. It needs a dedicated domain name that no other user can upload content to.

Or else why even bother maintaining a whitelist at all? Dropbox would defeat the purpose.

Anyway, also answer the other question. What is this for?

Ate you uploading on behalf of an organization? Which one?

What is the source of these images? Did you take them yourself?

ayaita17 wrote:

It's an institutional project, and the pictures are from different institutions that have released them with CC-BY and CC-BY-SA. We are still working on the templates but it's more or less described here:
https://commons.wikimedia.org/wiki/Commons:AlmanaqueAzul

ayaita17 wrote:

Once I use the s3 bucket, do I need to fill another bug report to add the domain from s3bucket to the white list?

Just comment on this bug with the bucket's name/domain.

(In reply to Ayaita from comment #2)

Why do you want to know if you are already saying you are not adding it?

I wanted to know so I would be able to suggest an alternative. I see that Jeremy has already done that.

I am not a super tech savvy user and don't know what you mean by "what?
Dropbox?"

Dropbox is a filesharing service usable by anyone. (I was assuming you knew this, since you included a Dropbox URL in the bug description.) Anyone can upload there, so it makes no sense to add it to our whitelist.

ayaita17 wrote:

(In reply to This, that and the other from comment #9)

(In reply to Ayaita from comment #2)

Why do you want to know if you are already saying you are not adding it?

I wanted to know so I would be able to suggest an alternative. I see that
Jeremy has already done that.

It was not clear that your "what? dropbox?" answer meant that. Please be more aware of different type of users next time,

I am not a super tech savvy user and don't know what you mean by "what?
Dropbox?"

Dropbox is a filesharing service usable by anyone. (I was assuming you knew
this, since you included a Dropbox URL in the bug description.) Anyone can
upload there, so it makes no sense to add it to our whitelist.

You were assuming things and I assumed things too. See, in that whitelist Flickr is listed, and to me, Flickr is a filesharing service usable by anyone.

(In reply to Ayaita from comment #10)

See, in that whitelist
Flickr is listed, and to me, Flickr is a filesharing service usable by
anyone.

AIUI, there is one shared whitelist for multiple uses.

IIRC, Flickr was on that list long before there was a GWToolset. (I think for a while Flickr was the only member of the list) Flickr is much different from Dropbox for 2 reasons (in general, not specifically in relation to GWToolset):

  • there are no arbitrary file uploads; files must be one of a few limited types of media (right?) and the site provides methods for recording various metadata on a per-file basis (including license)
  • we have processes specifically for dealing with Flickr including a bot that follows up on Flickr uploads soon after upload and verifies that the metadata at Flickr matches the file description page. AFAIK, no such comparison would be possible with dropbox; there would be nothing to compare against.

GWToolsat may also have some hardcoded special casing for Flickr, I'm not sure.

ayaita17 wrote:

Hi, we finally uploaded the pictures to S3 bucket, this is an example URL:

http://s3.amazonaws.com/mochila_images/02050019.jpg

The bucket's name is:
mochila_imagenes_import_1

I appreciate if this can be added to the GWToolset whitelist.

(In reply to Ayaita from comment #12)

http://s3.amazonaws.com/mochila_images/02050019.jpg

So you can use:
http://mochila_images.s3.amazonaws.com/02050019.jpg

The bucket's name is:
mochila_imagenes_import_1

So, now I'm confused. Is it mochila_images or mochila_imagenes_import_1 ?

ayaita17 wrote:

(In reply to jeremyb from comment #13)

(In reply to Ayaita from comment #12)

http://s3.amazonaws.com/mochila_images/02050019.jpg

So you can use:
http://mochila_images.s3.amazonaws.com/02050019.jpg

The bucket's name is:
mochila_imagenes_import_1

So, now I'm confused. Is it mochila_images or mochila_imagenes_import_1 ?

I'm sorry, my bad.

mochila_imagenes_import_1 is the name of the main bucket. Inside it, we created two buckets of images (and plan to add more later to better control different groups of images):
mochila_images
mochila_images2

The example url sent before is inside mochila_images

I hope this clarifies the request.

ayaita17 wrote:

Is there any other question about this? I really need to proceed with our image upload.

Please tell me if you need anything else to add our url from amazon s3 to the white list.

Change 145144 had a related patch set uploaded by Steinsplitter:
Adding mochila_images.s3.amazonaws.com and mochila_images2.s3.amazonaws.com temporary to wgCopyUploadsDomains for GWToolset upload.

https://gerrit.wikimedia.org/r/145144

Change 145144 abandoned by Ori.livneh:
Adding mochila_images.s3.amazonaws.com and mochila_images2.s3.amazonaws.com temporary to wgCopyUploadsDomains for GWToolset upload.

https://gerrit.wikimedia.org/r/145144

Change 145273 had a related patch set uploaded by Steinsplitter:
Adding new domains to wgCopyUploadsDomains.

https://gerrit.wikimedia.org/r/145273

Change 145273 merged by jenkins-bot:
Adding new domains to wgCopyUploadsDomains.

https://gerrit.wikimedia.org/r/145273