-
Notifications
You must be signed in to change notification settings - Fork 20
Fix how quotes are handled in guess phase #98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov Report
@@ Coverage Diff @@
## master #98 +/- ##
==========================================
+ Coverage 74.76% 75.68% +0.92%
==========================================
Files 11 11
Lines 1153 1197 +44
==========================================
+ Hits 862 906 +44
Misses 291 291
Continue to review full report at Codecov.
|
Is there any hope of using the existing quoted String token parser with I think one can just parse quoted stringtoken fields until a field ends exactly at a new line. There's no need to getrowend or even quotedsplit. |
Yeah, probably. I started writing
That sounds very plausible. I'm not sure when I'll have time again to get to that, though... Any chance you might take a stab at this? |
@shashi what should we do about this? Could we maybe merge it for now? It seems to fix a pretty clear bug for now. I agree it is not the most elegant way to do this, and it would be nicer to use the existing machinery for that, but right now from my point of view it would be preferable to fix this bug, and then we can see if someone has time later to make the code more elegant? |
Sounds good! :-) |
Fixes #97.
The problem with the old version is that the whole guess phase breaks down if there are line feeds inside quotes in the rows that are used to guess the column types.
I want to trigger code coverage first to get a sense what extra tests I need to add.@shashi Alright, now ready to be reviewed.