They say "don't build toy models with kaggle datasets" scrape the data yourself /u/01jasper CSCQ protests reddit

And I ask, HOW? every website I checked has ToS / doesn’t allowed to be scraped for ML model training.

For example, scraping images from Reddit? hell no, you are not allowed to do that without EACH user explicitly approve it to you.

Even if I use hugging face or Kaggle free datasets.. those are not real – taken by people – images (for what I need). So massive, rather impossible augmentation is needed. But then again…. free dataset… you didn’t acquire it yourself… you’re just like everybody…

I’m sorry for the aggressive tone but I really don’t know what to do.

submitted by /u/01jasper
[link] [comments]

r/cscareerquestions And I ask, HOW? every website I checked has ToS / doesn’t allowed to be scraped for ML model training. For example, scraping images from Reddit? hell no, you are not allowed to do that without EACH user explicitly approve it to you. Even if I use hugging face or Kaggle free datasets.. those are not real – taken by people – images (for what I need). So massive, rather impossible augmentation is needed. But then again…. free dataset… you didn’t acquire it yourself… you’re just like everybody… I’m sorry for the aggressive tone but I really don’t know what to do. submitted by /u/01jasper [link] [comments]

And I ask, HOW? every website I checked has ToS / doesn’t allowed to be scraped for ML model training.

For example, scraping images from Reddit? hell no, you are not allowed to do that without EACH user explicitly approve it to you.

I’m sorry for the aggressive tone but I really don’t know what to do.

submitted by /u/01jasper
[link] [comments]

They say “don’t build toy models with kaggle datasets” scrape the data yourself /u/01jasper CSCQ protests reddit

Leave a Reply Cancel reply