How to approach parasing large JSON datasets? /u/BigBootyBear Python Education

I want to create a word frequency table of Epicurious – Recipes in kaggle. Specifically the word freqeuncy of the recipe directions text. Now i’m a web developer, so up until this point i’ve only worked with small JSON objects. I’ve done print(json_data.shape) (I assume I need to know the schema to traverse the object) and only got the top object’s schema.

Suddenly i’ve realized I can’t just print(json_data.shape) for the entire object cause JSON is unsafe and I’d have N of keys for N nodes within the file (20k nodes). Now i’m confused and i’d like some help with how I should think about the problem and how I should approach it.

submitted by /u/BigBootyBear
[link] [comments]

​r/learnpython I want to create a word frequency table of Epicurious – Recipes in kaggle. Specifically the word freqeuncy of the recipe directions text. Now i’m a web developer, so up until this point i’ve only worked with small JSON objects. I’ve done print(json_data.shape) (I assume I need to know the schema to traverse the object) and only got the top object’s schema. Suddenly i’ve realized I can’t just print(json_data.shape) for the entire object cause JSON is unsafe and I’d have N of keys for N nodes within the file (20k nodes). Now i’m confused and i’d like some help with how I should think about the problem and how I should approach it. submitted by /u/BigBootyBear [link] [comments] 

I want to create a word frequency table of Epicurious – Recipes in kaggle. Specifically the word freqeuncy of the recipe directions text. Now i’m a web developer, so up until this point i’ve only worked with small JSON objects. I’ve done print(json_data.shape) (I assume I need to know the schema to traverse the object) and only got the top object’s schema.

Suddenly i’ve realized I can’t just print(json_data.shape) for the entire object cause JSON is unsafe and I’d have N of keys for N nodes within the file (20k nodes). Now i’m confused and i’d like some help with how I should think about the problem and how I should approach it.

submitted by /u/BigBootyBear
[link] [comments] 

Leave a Reply

Your email address will not be published. Required fields are marked *