Outlook for mythtv-0.22

depontius · Advocate Joined: 05 May 2004 Posts: 3509

Looks bad...

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

Here's a version that will give you some detail as to the warnings:

http://digitalaudiorock.com/scripts/mythtv_0_22_corruption_test_det.pl

The recorded table is obviously important, as that represents your current recordings. oldrecorded is important for scheduling to know what's been recorded. Those seem to be the least of your errors, which is good.

As I mentioned previously, I think that clearing the oldprogram will affect nothing more than the ability to to a "New Titles" search. I also believe that people table can be cleared of anything not referenced in credits or recordedcredits.

DISCLAIMER: I can't as yet guarantee any of this!

Here's what I did from a mysql prompt to reduce my errors dramatically:

depontius · Advocate Joined: 05 May 2004 Posts: 3509

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

depontius · Advocate Joined: 05 May 2004 Posts: 3509

Thanks, got it. I've looked through the results, and on their own there's not a lot to go by. When I have more time I'll grab a dump, and see how I can cross-reference the two together.
_________________
.sigs waste space and bandwidth

depontius · Advocate Joined: 05 May 2004 Posts: 3509

OK, I've chased down the two entries in "recorded".

"Eureka" subtitle "Noche de Suenos" - with an accent on that last "n" that caused the problem.
"The Hour Holiday Special" with Michael Buble' in the description, and that accented "e".

I'm not sure what to do with this, other than delete those 2 shows, scrap the whole table, or try your script.

I notice that you focused on 4 tables, and in discussion of fixing "partial corruption" I believe that they focus on 4 tables, also. Same 4?

---------------- edit ----------------------------

After further thought, I need to restate this...

Does your script indicate that I have a "uniformly corrupted," (not good) "partially corrupted," (bad) or "unfixable by published means" (really bad) database?
In other words, is simply attempting to upgrade the database by running the mythtv-0.22 mythtv-setup likely to fail?
If I do the suggested backup/drop/restore, then upgrade is that also likely to fail?
If I follow the directions for a "partially corrupt" database, losing all of my configuration in the process, then upgrade is that too likely to fail?

How about if I nuvexport, then delete those 2 offending shows, then give up on everything except my existing "recorded" database?

Oddly enough, had I just lived with the "bad" my.cnf from day 1, I'd probably have a "uniformly corrupt" database now, and have a fairly easy time of the upgrade. Because I saw instructions to "fix" it, either in forums, news, or something, THEN somehow my.cnf got back to utf8, I'm in a royal mess.

---

On another tack... I found the accented characters your script pointed out as the problem. Does the flagging by your script mean that those strings are present in the data multiple places, sometimes in latin1 and sometimes in utf8? Is the difference visible if I look at the raw mysql dump? It's a big file, but I've grubbed through it before, and I keep thinking about a script to parse it out a bit, making it more readable.

One question though... Is mysql whitespace-insensitive, as long as the whitespace is in the right place? One of the first things I'm thinking of is "s/),(/),\n(/g" to split table rows onto separate lines. I'm also thinking of a formatted print, to make them look more like a text table or spreadsheet. Beyond that, maybe a pair of utilities to split the dump into file-per-table-in-a-directory and another to glom them back together, again. Certainly good for reading and learning, but could mysql read the result of a round-trip?
_________________
.sigs waste space and bandwidth

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

The problem itself as I understand it was caused when the client connection was utf8 and things such as accented characters were involved. In that situation MythTV ended up trying to insert utf8 data into tables with latin1 character columns. This creates corrupted string data that can not be properly converted to utf8 later on.

The "incorrect string value" warnings you get in the test on rows with bad data actually causes data to be truncated. The four tables they test before attempting the actual upgrade are: people, oldprogram, recorded, and oldrecorded. I believe that the reason they focus on these is that bad data in some of their columns can cause a situation where the conversion to utf8 causes multiple rows to have the same value where unique indexes or primary keys are involved, which would cause the upgrade of the table to fail. I think that another reason they focus on these may be that they are the tables specifically susceptible to this, as they get their data from sources such as schedules direct which can frequently involve such characters.

Here's an example. I'm doing this test in a database that is latin1 by default, but with a client connection of uft8:

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

Somehow I missed your questions above...I think you added them after my last reply. By the way...I was able to upgrade and everything's working fine.

depontius · Advocate Joined: 05 May 2004 Posts: 3509

Thanks for the detailed response. To make a long story short, I think I'm going to...

1 - Mask 0.22 until I'm good and ready, and that means getting Eureka and a few other shows transcoded and OUT of mythtv.
2 - Back up my database. (both as a mysqldump and a "cp -a" of the whole blinking mysql directory)
3 - Back up the backup. (mysqldump only, let's not get too ridiculous)
4 - Make sure I've got the source for all of my myth packages and qt3 saved away.
5 - Do a qpkg of existing mythtv packages, just to save time.

Obviously all of this is to keep myself well covered.

6 - Do the drop/reload on the database, since it's seems obvious that it's not going upgrade as-is.
7 - Try the upgrade, and hope.
8 - If that fails, follow the instructions for the partial upgrade, and hope.
9 - If that fails, start over and be glad I did step #1 above.

Back on playing with reformatting the mysql dump, the string I was looking at was "),(" to insert the newline. So the false newlines would not be added to every close-paren, but to places where there was a comma between a close paren and an open paren, as 3 consecutive characters. I'll admit the possibility of a false positive, but I suspect they'd be pretty rare.

Somewhere in this line, I just thought of the idea of converting a mysql dump into python code (with pysql bindings) that would recreate that database. At that point, I'm not sure what the whole point of the exercise is any more unless it would be possible to somehow make the record numbers/pointers symbolic in some way, so it would be possible to say, move a line in the "people" table and not have the whole thing come unglued. I suspect I'd be better off looking into one of the php/mysql interfaces (Isn't that really what mythweb is?) or the ability for OpenOffice to interface to mysql.
_________________
.sigs waste space and bandwidth

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

depontius · Advocate Joined: 05 May 2004 Posts: 3509

I'm not saying to hold off for long, just long enough to do a measured job.

I'd rather not see another "expat" mess.

Besides, it's for people who already have mythtv installed, in which case they've already got qt3 installed. Clearly neither are getting upgrades, but both remain functional for a transition. Outside of this thread, the bugzilla thread, and general mythtv rumble, there isn't a lot of warning. I don't know how many simply use mythtv like any other piece of software - as an appliance, and how many keep track of developments. Someone not cued in might not be aware, might let the upgrade just happen without being fully backed up, and lose data. (Or at least indexes to data, leaving them with a bunch of numbered files.)
_________________
.sigs waste space and bandwidth

depontius · Advocate Joined: 05 May 2004 Posts: 3509

Next question...

What about "nuvexport"? There is one version in portage, and it is hard-tied to mythtv-0.21*. Looking at the mythtv pages, nuvexport still exists, as part of what they call "mythextras", though there is no package of that name in portage.
_________________
.sigs waste space and bandwidth

depontius · Advocate Joined: 05 May 2004 Posts: 3509

One more question...

I was planning to set aside time today to do the MythTV upgrade, and started reading through the "Fixing Corrupt Database Encoding" guide again, and got to here:

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

depontius · Advocate Joined: 05 May 2004 Posts: 3509

But I've looked at the "fixing the database", and it all seems to be about changing "SET NAMES utf8" to "SET NAMES latin1", which occurs once in the dump of mythconverg. I took a fresh dump this morning, did a grep for "SET NAMES", and it already says "SET NAMES latin1". I remember doing something like this a year or so back, fixing my.cnf, dropping and reloading the database, though I don't remember editing anything. It seems to me that

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

depontius · Advocate Joined: 05 May 2004 Posts: 3509

Toast. (My setup, that is - failed upgrade, flagged as "partial corruption".)

Sometime a year or two back, there was some sort of notice about diddling with the mythtv database. I followed all of the directions. I know there was something in there about setting my.cnf to latin1, and something about dropping and reloading mythconverg. I don't remember too much more, only that I followed directions carefully.

When getting prepped for 0.22, I noticed that my.cnf once again had utf8. I don't know how that happened, whether the directory wasn't properly protected from automatic updates, or whether I missed one when doing etc-update. THAT is what I blame for the current problem - I had "corrected" the database long ago, following directions, and some time later my.cnf got switched back. I also connected my daughter's Ubuntu machine as a client, but I'm under the impression that only Gentoo keeps the default utf8 - everyone else switches it to latin1. At any rate, I can't easily inspect her machine, at the moment.

I tried your "fixit" script and it failed:

depontius · Advocate Joined: 05 May 2004 Posts: 3509

Latest status... still toasty

I did a "partial restore", which should have left my previously recorded programs available - but it didn't. I've effectively started completely from scratch. It's fetching SchedulesDirect right now, and I'm going to exercise things a little. I'm alsogoing to take a look at the "partial restore" database done right before the conversion, and see what the heck was in there. Then I'm going to dump the post-conversion database and see what is in that.

All in all, I think I'm headed back to 0.21 shortly. I've got 0.22 working, and I'll probably qpkg it, so I can get back quickly. I'll also probably both dump and "cp -a" save the database.

But I'd really like to experiment more with "fixing" my database. Maybe it's time to get the python/mysql bindings and start learning. Still, I had things pretty well emptied before even trying this - I'd just like to try a little more at preserving my old stuff.

--- edit ---

Just checked my dumps. When you run mythtv-setup it makes a backup of your old database. This was the result of my "partial restore". I pulled it over to /tmp, gunzipped it, and started looking. At the very least, there's content in there. Then I did a dump of the new 0.22 database, and it's empty of old content - as if I'd completely started over. No "oldrecorded", no "recorded", a smaller "people" that's likely the result of the first pull from SchedulesDirect.

Time to qpkg, save the database, and head back to 0.21 and python/mysql info. I also want to head to the list, to understand why it threw out my whole database after the partial restore. At this point, I can probably set things up to swap back and forth fairly quickly.
_________________
.sigs waste space and bandwidth

tld · Veteran Joined: 09 Dec 2003 Posts: 1816

depontius · Advocate Joined: 05 May 2004 Posts: 3509

Fixed.

I just went back and re-ran the upgrade, and re-did the partial database restore. I believe I found the problem with the last time I tried, though I don't understand exactly why what I did produced the results that it did.

The partial restore instructions have you drop the database, recreate the database, restore the blank 1214 snapshot, and restore your own backup. The problem was that those instructions didn't have the "--partial_restore" flag on the command that restored my old backup. I can easily see that being wrong, though I don't understand why it created a full-sized database. On my first upgrade attempt, the pre-upgrade database backup was approximately "full-sized". On this upgrade attempt, the pre-upgrade database was approximately "half-sized" and my first post-upgrade database backup is approximately "2/3-sized".

I'm done. I'm not going to tempt fate any longer. Time to move the other machines up to 0.22, then to go back and fudge up some sort of nuvexport installation, since there isn't any nuvexport in portage to go with 0.22.

I do need to go back on the mythtv list to explain what happened. I also need to go back on bugzilla and make sure that the partial-restore instructions are properly corrected.
_________________
.sigs waste space and bandwidth

yngwin · Posted: Mon Feb 22, 2010 5:41 am Post subject:

Please add your findings to this topic: https://forums.gentoo.org/viewtopic-t-816566-highlight-.html
_________________
"Those who deny freedom to others deserve it not for themselves." - Abraham Lincoln
Free Culture | Defective by Design | EFF

dnm · Posted: Fri Feb 26, 2010 12:56 am Post subject: my machine does more than mythtv

Wow, I am definitively holding off upgrading mythtv to 0.22 for as long as possible. I am a longtime user and I have partial corruption (sigh). I also use other databases than mythconverg, so I am also thinking that changing a "misconfigured server" /etc/mysql/my.conf (as explained by the Fixing Corrupt Database Encoding) might not be a good idea. Also please wait with removing qt3 and mythtv 0.21 until there is a clear guide, reading up on it at this moment is really making my head hurt. Every case/how-to is just ignoring the fact that there might be more than mythtv's database. And what little text there is about that, is so vague, that it does not enlighten.

depontius · Advocate Joined: 05 May 2004 Posts: 3509

I wouldn't count on them holding off on qt-3 very long. The big push for mythtv-0.22 is really the push to get qt-3 out of portage.

Just back up your qt-3 - back up the source file, back up the ebuild - same with mythtv-0.21, if your so inclined. Make your own portage overlay (look for PORTDIR_OVERLAY) and put the mythtv-0.21 and qt-3 in it, in the appropriate categories, make sure you do run "ebuild ... digest" against them. Either make sure you have the source files, or run "emerge -f =x11-libs/qt-3### =media-tv/mythtv-0.21###" to get the source files - then mark them readonly, if not "chatter +i" so they don't get accidentally erased.

At this point, you should be able to rebuild, if anything goes wrong - from your own private overlay.

I was really worried about the upgrade - and it did take me 2 tries. Look earlier on this thread for my experiences and "partial corruption upgrade guide". It's also worth mentioning that once you're at mythtv-0.22, the character set configuration of the database won't matter any more. So actually right now you're more brittle. Once you're able to get to mythtv-0.22, you'll be better off.

I will also mention that I did my backups, upgraded, failed to upgrade the database, and went back to mythtv-0.21. On my second try I succeeded, and the key was adding the "--partial_restore" flag to the published instructions when I loaded my database over the "blank" database.

For your multiple database situation, I'd quiesce everything else while you're doing the mythtv upgrade. As I said, once you're done with the upgrade, the default character set won't matter any more, so you can put it all back to where you started. Then restart your other applications. While I had mysqldumps, my primary backup method was to stop all db applications, stop mysql, and use "cp -a" against "/var/lib/mysql". I just copied/moved everything at once.
_________________
.sigs waste space and bandwidth

yngwin · Posted: Fri Feb 26, 2010 12:14 pm Post subject:

It looks like the news item will be posted on Monday, which will mean the stabilization can go ahead immediately. Qt3 will be masked along with everything that depends on it the same day. Users who want to hang on to it can use the kde-sunset overlay.
_________________
"Those who deny freedom to others deserve it not for themselves." - Abraham Lincoln
Free Culture | Defective by Design | EFF