guidebook_event: 28625726
id | guide_id | import_id | name | description | image | startTime | endTime | allowRating | addToSchedule | deleted | last_updated | locations | tracks | rank | links | allday | max_capacity | registered_attendees | waitlist | require_login | registration_start_date | cog_details | session_discussion_posting_disabled |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
28625726 | 191394 | Python: Working with PDFs using pdfplumber | <p>Wonderful tools such as Tabula have made it easier to extract tabular data from PDFs. But what if your pile of PDFs is more complex than that? Maybe there are a few bits of info that you need to grab outside the tables, or maybe the information isn't tabular at all?<br><br>In this session, we'll use pdfplumber, an open-source Python library, to demonstrate some techniques. We'll also demystify some aspects of the PDF file format, which will come in handy no matter what tools you use.<br><br>This session would be good for: People with some prior experience using Python.</p> | 2023-03-03 10:15:00 | 2023-03-03 11:15:00 | 0 | 1 | 4296416 | 579485 | 84 | [ { "categoryTitle": "Speakers", "links": [ { "title": "Jeremy Singer-Vine", "label": "The Data Liberation Project", "image": "", "gb_url": "gb://guide/191394/poi/?id=15476281", "target_object_id": 15476281, "target_content_type": "custom_list.customlistitem" } ] }, { "categoryTitle": "Forms", "links": [ { "title": "NICAR23 Session Evaluation", "label": "Please complete this session evaluation", "image": "", "gb_url": "gb://guide/191394/survey/?id=20962&object_id=28625726&content_type=schedule.session", "target_content_type": "survey.survey" } ] } ] |
0 | -1 | -1 | 0 | 0 | [] |
0 |