For the past few months, I’ve been receiving files to process. Unfortunately, I have no control over the layout and haven’t gotten any indication that it’s going to change. My boss put together a data flow that gives us the output we need, but it requires some manual editing on my part every week. This isn’t a big deal, but we’ve been automating a lot of processes lately, and I get the feeling this one might be next on the list.
The problem is that the columns change every week. One week, it might be Orientation columns 1, 2, 3, Overtime columns 1, 2, 3, and Regular Hour columns 1, 2, 3. Another week, it could just be Overtime columns 1 and Regular Hour columns 1, 2, 3. For our purposes, all we care about are Orientation 1, 2; Overtime 1, 2, 3; and Regular Hour 1, 2, 3. Additionally, some* of the columns are appended to show hours and wages. So every week I have to delete/add columns so that it’s ready and it’s never really the same.
I’ve used python before, but only for minimal data analytics projects, so I’m not sure where to start researching a solution. Any suggestions? Should I be taking another route besides python?
example: https://imgur.com/a/WjQRBRS
submitted by /u/Busy_You4579
[link] [comments]
r/learnpython For the past few months, I’ve been receiving files to process. Unfortunately, I have no control over the layout and haven’t gotten any indication that it’s going to change. My boss put together a data flow that gives us the output we need, but it requires some manual editing on my part every week. This isn’t a big deal, but we’ve been automating a lot of processes lately, and I get the feeling this one might be next on the list. The problem is that the columns change every week. One week, it might be Orientation columns 1, 2, 3, Overtime columns 1, 2, 3, and Regular Hour columns 1, 2, 3. Another week, it could just be Overtime columns 1 and Regular Hour columns 1, 2, 3. For our purposes, all we care about are Orientation 1, 2; Overtime 1, 2, 3; and Regular Hour 1, 2, 3. Additionally, some* of the columns are appended to show hours and wages. So every week I have to delete/add columns so that it’s ready and it’s never really the same. I’ve used python before, but only for minimal data analytics projects, so I’m not sure where to start researching a solution. Any suggestions? Should I be taking another route besides python? example: https://imgur.com/a/WjQRBRS submitted by /u/Busy_You4579 [link] [comments]
For the past few months, I’ve been receiving files to process. Unfortunately, I have no control over the layout and haven’t gotten any indication that it’s going to change. My boss put together a data flow that gives us the output we need, but it requires some manual editing on my part every week. This isn’t a big deal, but we’ve been automating a lot of processes lately, and I get the feeling this one might be next on the list.
The problem is that the columns change every week. One week, it might be Orientation columns 1, 2, 3, Overtime columns 1, 2, 3, and Regular Hour columns 1, 2, 3. Another week, it could just be Overtime columns 1 and Regular Hour columns 1, 2, 3. For our purposes, all we care about are Orientation 1, 2; Overtime 1, 2, 3; and Regular Hour 1, 2, 3. Additionally, some* of the columns are appended to show hours and wages. So every week I have to delete/add columns so that it’s ready and it’s never really the same.
I’ve used python before, but only for minimal data analytics projects, so I’m not sure where to start researching a solution. Any suggestions? Should I be taking another route besides python?
example: https://imgur.com/a/WjQRBRS
submitted by /u/Busy_You4579
[link] [comments]