Now at first this is probably going to seem like, well... The kind of thread that will probably locked after a couple of answers and thrown out with cries of RTFM, GOOGLE IT!!!, etc...
I am not quite sure why, but it seams to have that texture as it's forming in my head.
It's also possibly the fact that this is so OT to the general topic of this forum, on top of it being more of an GIVE ME CODE/HELP! question than a can you help me with this implementation question...
BUT I'm giving you a sour taste in the mouth before you even read this, but everything will soon become clear.
My problem is simply that I need a very large database of music to be able to conduct my experiment.
The first obstacle is to write a bot that can find pages with a certain number of views, certain tags, a certain number of people who have rated the music etc... and of course recover this data.
This I will deal with myself.
The only problem I have now is to find the easiest way to then be able to recover the video and strip out the audio for my use.
I wasn't able to find any opensource software that does this or any documentation.
So I guess my question boils down to, does anybody have any clues, information or anything on how I could write something to retrieve videos from youtube (I am aware that there is already software out there that does this, but I want something which will allow me to do all my processing before I write it to disc as well as making it as efficient as possible as I need to process massive amounts of data, so I want to make it as quick as possible, so I don't want to be calling some software that'll write the file to disk then have to load of the disk again and do my processing...).
Now comes the BIG caveat which will have all your heads noding, it's not that I'm not particularly interested, but I am pretty single minded about this project and I don't want to have to really learn much about what I'm doing on this relatively theoretically irrelevant (though critical) to what I'm doing, of course it is something that in time I'd be interested in looking at in more depth, but right now... not really...
I'm sure that many of you could understand this as there are many more important more interestin things that I need to learn for this project.
Thanks in advance,
Jules
P.S. I think it's clear, but I have almost no experience in any form of use of networking in the context of coding is almost equal to zero...
P.P.S Does anybody know whether I would be excuse from any licensing/copyright issues as it is use of data for academic purpose?
sample collection
-
- Member
- Posts: 2566
- Joined: Sun Jan 14, 2007 9:15 pm
- Libera.chat IRC: miselin
- Location: Sydney, Australia (I come from a land down under!)
- Contact:
Re: sample collection
http://last.fm
Massive database of songs, statistics, etc... Probably easier than doing whatever it is you're trying to do with videos?
For example, this song has 62,411 plays at the time of posting.
Actually obtaining the music is another story, and even though this idea is for academic reasons you will have a hard time justifying grabbing that much music and storing it locally.
Massive database of songs, statistics, etc... Probably easier than doing whatever it is you're trying to do with videos?
For example, this song has 62,411 plays at the time of posting.
Actually obtaining the music is another story, and even though this idea is for academic reasons you will have a hard time justifying grabbing that much music and storing it locally.
Re: sample collection
Wow, nobodies started shouting yet..
The problem is for a given song I need to be able to have this data:
-tags (to know what style of music)
-number of views
-number of votes for rating.
-rating ( in the case of youtube out of 5)
But that wasn't really what my question was about.
The most important piece of data here, is the song itself (although without the rest it's pretty useless).
So my question is where can I find the documentation/info necessary to recover a video from youtube. (As it seams to be the only website with the volume of data required as well as the stats I need).
Thanks for the help though!
(Don't want to appear rude as I haven't been around for a while...)
Thanks in advance,
Jules
The problem is for a given song I need to be able to have this data:
-tags (to know what style of music)
-number of views
-number of votes for rating.
-rating ( in the case of youtube out of 5)
But that wasn't really what my question was about.
The most important piece of data here, is the song itself (although without the rest it's pretty useless).
So my question is where can I find the documentation/info necessary to recover a video from youtube. (As it seams to be the only website with the volume of data required as well as the stats I need).
Thanks for the help though!
(Don't want to appear rude as I haven't been around for a while...)
Thanks in advance,
Jules
-
- Member
- Posts: 199
- Joined: Sat Jun 28, 2008 6:44 pm
Re: sample collection
I'd built a small python app (to which I later ported to Chicken scheme) which downloaded youtube videos. It's not hard at all. You're only real limit is storage.suthers wrote:Wow, nobodies started shouting yet..
The problem is for a given song I need to be able to have this data:
-tags (to know what style of music)
-number of views
-number of votes for rating.
-rating ( in the case of youtube out of 5)
But that wasn't really what my question was about.
The most important piece of data here, is the song itself (although without the rest it's pretty useless).
So my question is where can I find the documentation/info necessary to recover a video from youtube. (As it seams to be the only website with the volume of data required as well as the stats I need).
Thanks for the help though!
(Don't want to appear rude as I haven't been around for a while...)
Thanks in advance,
Jules