User talk:Reedy/Archive 2012

From WikiProjectMed
Jump to navigation Jump to search

Requesting help with Wikipedia:WikiProject TypoScan

Hi Reedy! It would be great to get Wikipedia:WikiProject TypoScan back on track. Is this something you would be willing to do? If not, is there some way the source code could be posted or handed off in a bot request so others could pick up where you left off? Thanks! GoingBatty (talk) 22:58, 30 December 2011 (UTC)

The biggest issue here is someone downloading, extracting and running the DB Scanner over the whole database dump. Not that difficult, just will take a long time.. I now have however, a hex core desktop machine with 24GB ram... so it should eat through it... Reedy (talk) 23:10, 1 January 2012 (UTC)
It would be wonderful if you could do that. Thanks! GoingBatty (talk) 00:05, 2 January 2012 (UTC)

The Signpost: 02 January 2012


Thanks

Thanks for fixing my settings. Your IT knowledge is impressive. Im planning to take an IT course myself so i wont be bewildered everytime i see computer programming. Pass a Method talk 17:18, 4 January 2012 (UTC)

Request for Interview Regarding Wikipedia Bots

Greetings Reedy-

My name is Randall Livingstone, and I am a graduate student at the University of Oregon, currently collecting data for my dissertation on Wikipedia editors who create and use bots and assisted editing tools, as well as editors involved in the initial and/or ongoing creation of bot policies on Wikipedia. As a member of BAG and the bot community, I would very much like to interview you for the project at a time and in a method that is most convenient for you (Gchat, another IM client, Skype, email, telephone, etc.). I am completely flexible and can work with your schedule. The interview will take approximately 30-45 minutes.

My dissertation project has been approved both by the Institutional Review Board (IRB) at the University of Oregon, and by the Research Committee at the Wikimedia Foundation. You can find more information on the project on my meta page.

Please let me know if you have any questions, and I look forward to hearing from you to set up a time to chat. Thank you very much.

Randall Livingstone, School of Journalism & Communication, University of Oregon

UOJComm (talk) 00:51, 6 January 2012 (UTC)

reply

You have new message/s Hello. You have a new message at GoingBatty's talk page. GoingBatty (talk) 00:51, 7 January 2012 (UTC)

The Signpost: 09 January 2012

MediaWiki code change

Hi Reedy. Are you the same reedy who is named at rev:108622? If so, could you please take a look at WP:VPT#Missing section edit links on a diff and advise? Thanks. --Redrose64 (talk) 22:06, 11 January 2012 (UTC)

I am, but I was merely doing the merge. It's not an area of code I know, so it'll be best to wait for input back on the bug. Reedy (talk) 23:46, 12 January 2012 (UTC)

The Signpost: 16 January 2012


The Signpost: 23 January 2012

Typoscan update?

Hi Reedy - any luck with the Typoscan update? Thanks! GoingBatty (talk) 03:12, 27 January 2012 (UTC)

Over 125,000 articles, still going.. Not too long left though! Reedy (talk) 12:46, 27 January 2012 (UTC)
144382. I'm wondering if I should no pre-parse the list against the live Wikipedia to filter out non typoed articles now. Now this is something that I've done yet.. I don't know whether we can (in theory, it should be fine). I'll see about taking a backup of the database as it is now, and then get it up on toolserver. Importing should be a quick activity... Reedy (talk) 16:23, 27 January 2012 (UTC)
Thanks for the update! GoingBatty (talk) 18:39, 27 January 2012 (UTC)

The Signpost: 30 January 2012

The Signpost: 06 February 2012

The Signpost: 13 February 2012

The Signpost: 20 February 2012

MSU Interview

Dear Reedy,

My name is Jonathan Obar user:Jaobar, I'm a professor in the College of Communication Arts and Sciences at Michigan State University and a Teaching Fellow with the Wikimedia Foundation's Education Program. This semester I've been running a little experiment at MSU, a class where we teach students about becoming Wikipedia administrators. Not a lot is known about your community, and our students (who are fascinated by wiki-culture by the way!) want to learn how you do what you do, and why you do it. A while back I proposed this idea (the class) to the communityHERE, where it was met mainly with positive feedback. Anyhow, I'd like my students to speak with a few administrators to get a sense of admin experiences, training, motivations, likes, dislikes, etc. We were wondering if you'd be interested in speaking with one of our students.


So a few things about the interviews:

  • Interviews will last between 15 and 30 minutes.
  • Interviews can be conducted over skype (preferred), IRC or email. (You choose the form of communication based upon your comfort level, time, etc.)
  • All interviews will be completely anonymous, meaning that you (real name and/or pseudonym) will never be identified in any of our materials, unless you give the interviewer permission to do so.
  • All interviews will be completely voluntary. You are under no obligation to say yes to an interview, and can say no and stop or leave the interview at any time.
  • The entire interview process is being overseen by MSU's institutional review board (ethics review). This means that all questions have been approved by the university and all students have been trained how to conduct interviews ethically and properly.


Bottom line is that we really need your help, and would really appreciate the opportunity to speak with you. If interested, please send me an email at obar@msu.edu (to maintain anonymity) and I will add your name to my offline contact list. If you feel comfortable doing so, you can post your nameHERE instead.

If you have questions or concerns at any time, feel free to email me at obar@msu.edu. I will be more than happy to speak with you.

Thanks in advance for your help. We have a lot to learn from you.

Sincerely,

Jonathan Obar --Jaobar (talk) — Preceding unsigned comment added by 35.9.34.167 (talk) 20:51, 22 February 2012 (UTC)

Bugzilla

Hi Reedy, What was the result of this bug? --M.Gedawy Talk 23:10, 23 February 2012 (UTC)

The Signpost: 27 February 2012

The Signpost: 05 March 2012

A barnstar for you!

The Technical Barnstar
For your AWB, Toolserver, TypoScan and RegEx work. Ankit MaityTalkContribs 09:20, 7 March 2012 (UTC)

The Signpost: 12 March 2012

The Signpost: 19 March 2012

The Signpost: 26 March 2012

The Signpost: 02 April 2012

Talkback

Hello, Reedy. You have new messages at Codrinb's talk page.
Message added 15:40, 7 April 2012 (UTC). You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.

Codrin.B (talk) 15:40, 7 April 2012 (UTC)

The Signpost: 09 April 2012

TypoScan questions

Hi Reedy! Could you please make sure that this edit I've made to Wikipedia:WikiProject TypoScan is accurate? Thanks! GoingBatty (talk) 03:40, 14 April 2012 (UTC)

Also, any idea why I'm not showing up on the list of users at tools:~awb/typoscan/? Thanks! GoingBatty (talk) 16:52, 14 April 2012 (UTC)
Two more questions about the stats, and I'll stop for tonight:
  • Should the "Number of Currently Checked Out Articles" in the Overview match the "Number of Currently Checked-Out Articles" in the Site states? (It doesn't match now).
  • Should the "Number of Never Checked Out Articles" in the Overview be "Number of Articles" less "Number of Ever Checked Out Articles"? (Right now it's 0).
Thanks! GoingBatty (talk) 03:58, 15 April 2012 (UTC)

Thanks for getting TypoScan working again. Regards, SunCreator (talk) 14:22, 15 April 2012 (UTC)

The Signpost: 16 April 2012

The Signpost: 23 April 2012

The Signpost: 30 April 2012

Typoscan

Hi Reedy! I was hoping to find out about the next Typoscan dump or ideas, I'm currently clearing the backlog out and finding a lot of typos rather then very few, several pages keep reoccurring even though I have already skipped them several times. I was wondering how does it issue pages out, is there overlapping or any issue as the scan approaches 100%? I'm in this project for the long haul! Not sure how much longer the backlog will remain with me around and I just wanted to make sure that Typoscan can check what I assume is a grossly inflated set of skips and ignores rather than false positives. ChrisGualtieri (talk) 22:45, 30 April 2012 (UTC)

There's quite a lot of variance between the dump and live. The dump could be upto a couple of months old, so factor in to there, people may have fixed them/removed the words, the typo rules may have been changed. There will be some cases where people have scanned and dealt with the page, but not submitted it. This gives quite a lot of variance... AWB itself uses pretty much the same process for processing the live articles, as from the database dumps. Have you been keeping an eye on the statistics (tools:~awb/typoscan/? Has the finished count etc gone up? Reedy (talk) 23:16, 30 April 2012 (UTC)
I have a printed record from April 25th, no new articles were added. The finished article count goes up as well as the ignore... exactly in line with my contributions to the project. Looks like no other user has been active in the project since then either. I'm sitting at 11,013 edited articles and 4,649 skipped. So simply... it hasn't updated yet. ChrisGualtieri (talk) 02:30, 1 May 2012 (UTC)
I just did a batch of 100 articles (my first since April 25), and see that several of the stats values have changed. Two issues though:
  1. The "Number of Currently Checked Out Articles" in the overview is less than the "Number of Currently Checked-Out Articles" in the Site Stats
  2. I'm still not listed in the User Stats.
Your continued assistance is always appreciated. Thanks! GoingBatty (talk) 03:13, 1 May 2012 (UTC)
As I am still actively working it might be that some of the currently checked out articles correspond to me, as I just refreshed another 500 block. If you only did 100 then the other 3500 were mine. I do not do 100 blocks, but I cannot see your contribution to the project. That is odd. ChrisGualtieri (talk) 03:42, 1 May 2012 (UTC)
Might be worth taking a stats snapshot as is, and empty all the tables for the next dump import. There are likely some discrepancies there from the original development. I'll pull a copy of the database off later and have a poke around and see what's what; it's the easiest way to see the state of the article table etc. There is possibly some crappy SQL queries that might be cause of some of the small discrepancies. Reedy (talk) 10:10, 1 May 2012 (UTC)

I see "ChrisGualtieri" in the database, but not GoingBatty... Reedy (talk) 15:25, 1 May 2012 (UTC)

User Number of Saved Articles Number of Skipped Articles Total Number of Articles
Reedy 84 1,521 1,605
Reedy 109 1,530 1,637

Before and after stats above - 75 edits, 25 skipped, but they're wrong.

mysql> select * from articles where userid = 2 ORDER BY articleid desc LIMIT 100;
+-----------+---------------------------------------------------+---------------------+----------+--------+--------+---------------------+--------+
| articleid | title                                             | checkedout          | finished | skipid | userid | checkedin           | siteid |
+-----------+---------------------------------------------------+---------------------+----------+--------+--------+---------------------+--------+
|    250360 | Rescue craft                                      | 2012-05-01 15:32:42 |        1 |      0 |      2 | 2012-05-01 16:05:28 |      1 |
|    250359 | Tick (check mark)                                 | 2012-05-01 15:32:42 |        1 |      0 |      2 | 2012-05-01 16:05:28 |      1 |
|    250358 | Lida                                              | 2012-05-01 15:32:42 |        1 |      0 |      2 | 2012-05-01 16:05:28 |      1 |
|    250357 | Fausto Elhuyar                                    | 2012-05-01 15:32:42 |        1 |      0 |      2 | 2012-05-01 16:05:28 |      1 |
|    250356 | Phi Delta Theta                                   | 2012-05-01 15:32:42 |        0 |      4 |      2 | 2012-05-01 15:49:48 |      1 |
|    250331 | Al-Hajj                                           | 2012-05-01 15:32:42 |        0 |  69285 |      2 | 2012-05-01 16:04:28 |      1 |
|    250330 | Band of the Scots Guards                          | 2012-05-01 15:32:42 |        1 |      4 |      2 | 2012-05-01 16:04:26 |      1 |
|    250329 | List of defunct Canadian railways                 | 2012-05-01 15:32:42 |        1 |  69285 |      2 | 2012-05-01 16:04:26 |      1 |
|    250300 | Manès Sperber                                    | 2012-05-01 15:32:42 |        0 |  69285 |      2 | 2012-05-01 16:04:28 |      1 |
|    237950 | Kirstie Alley                                     | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:21 |      1 |
|    237949 | Sindhi language                                   | 2012-04-06 22:07:56 |        0 | 376665 |      2 | 2012-04-06 23:37:21 |      1 |
|    237941 | New Monarchs                                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:21 |      1 |
|    237923 | Highways in Greece                                | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:21 |      1 |
|    237919 | Ashton-Tate                                       | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:21 |      1 |
|    237918 | Dholak                                            | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:20 |      1 |
|    237917 | Djembe                                            | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:20 |      1 |
|    237916 | King Ghidorah                                     | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:20 |      1 |
|    237911 | Malinalco, State of Mexico                        | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:20 |      1 |
|    237899 | Rational ignorance                                | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:20 |      1 |
|    237895 | Islam in France                                   | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:20 |      1 |
|    237885 | In God We Trust                                   | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:19 |      1 |
|    237872 | Structural functionalism                          | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:19 |      1 |
|    237866 | Dwarf tossing                                     | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:19 |      1 |
|    237853 | Aromanians                                        | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:19 |      1 |
|    237838 | East Boston                                       | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:19 |      1 |
|    237834 | Schwingen                                         | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:19 |      1 |
|    237824 | Las Vegas (TV series)                             | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:18 |      1 |
|    237807 | Canton of Neuchâtel                              | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:37:18 |      1 |
|    237797 | Giants Stadium                                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:48 |      1 |
|    237795 | Carlota Alfaro                                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:47 |      1 |
|    237780 | Élysée Palace                                   | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:47 |      1 |
|    237778 | Chartreuse Mountains                              | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:47 |      1 |
|    237768 | Vaira Vīķe-Freiberga                            | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:46 |      1 |
|    237761 | List of television reporters                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:46 |      1 |
|    237745 | Mandurah                                          | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:46 |      1 |
|    237740 | Bill Shankly                                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:46 |      1 |
|    237735 | Pedro González de Mendoza                        | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:46 |      1 |
|    237730 | Khwarezm                                          | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:46 |      1 |
|    237720 | Francisco Jiménez de Cisneros                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:46 |      1 |
|    237700 | Common Rosefinch                                  | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:45 |      1 |
|    237695 | Folsom State Prison                               | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:45 |      1 |
|    237691 | Network society                                   | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:45 |      1 |
|    237690 | Feroze Gandhi                                     | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:45 |      1 |
|    237684 | Nihon Shoki                                       | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:44 |      1 |
|    237683 | METRORail                                         | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:44 |      1 |
|    237675 | Ranchi                                            | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:44 |      1 |
|    237674 | Tree sitting                                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:44 |      1 |
|    237673 | Jamshedpur                                        | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:44 |      1 |
|    237671 | Constitution of India                             | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:44 |      1 |
|    237667 | Indian Institute of Technology Roorkee            | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:43 |      1 |
|    237663 | Never Mind the Buzzcocks                          | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:32:43 |      1 |
|    237659 | Gary Johnson                                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:26 |      1 |
|    237653 | Ferdinand Tönnies                                | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:26 |      1 |
|    237646 | Paul Muldoon                                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:26 |      1 |
|    237637 | Púca                                             | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:25 |      1 |
|    237632 | Herzliya                                          | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:25 |      1 |
|    237631 | Swiss Army knife                                  | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:25 |      1 |
|    237629 | Jaleco                                            | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:25 |      1 |
|    237624 | Eastern Cape                                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:25 |      1 |
|    237607 | Gravesend, Kent                                   | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:25 |      1 |
|    237605 | Cups and balls                                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:24 |      1 |
|    237592 | Huntingdon Life Sciences                          | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:24 |      1 |
|    237586 | The Good Son (film)                               | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:24 |      1 |
|    237580 | Stevenage                                         | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:23 |      1 |
|    237577 | Hirschsprung's disease                            | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:23 |      1 |
|    237574 | Aripiprazole                                      | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:23 |      1 |
|    237567 | Prout's hypothesis                                | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:23 |      1 |
|    237561 | Archie Bunker                                     | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:23 |      1 |
|    237558 | Cangjie input method                              | 2012-04-06 22:07:56 |        0 | 376665 |      2 | 2012-04-06 23:30:23 |      1 |
|    237548 | Television in Greece                              | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:22 |      1 |
|    237541 | Feni District                                     | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:22 |      1 |
|    237536 | Sonoran Desert                                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:21 |      1 |
|    237526 | Nicolae Paulescu                                  | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:21 |      1 |
|    237517 | Dependent personality disorder                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:30:21 |      1 |
|    237513 | Invisible hand                                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:27:56 |      1 |
|    237498 | Mycroft Holmes                                    | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:27:56 |      1 |
|    237496 | Music of Florida                                  | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:27:56 |      1 |
|    237495 | List of best-selling singles in the United States | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:27:56 |      1 |
|    237479 | C. Odumegwu Ojukwu                                | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:27:56 |      1 |
|    237467 | Jana Novotná                                     | 2012-04-06 22:07:56 |        0 |      4 |      2 | 2012-04-06 23:27:56 |      1 |
|    237463 | Glossary of ecology                               | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:56 |      1 |
|    237460 | Lassi                                             | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:55 |      1 |
|    237454 | Rob Van Dam                                       | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:55 |      1 |
|    237432 | ASRAAM                                            | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:55 |      1 |
|    237430 | Bowen Island                                      | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:55 |      1 |
|    237428 | Legume                                            | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:55 |      1 |
|    237426 | QR decomposition                                  | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:55 |      1 |
|    237419 | Feral                                             | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:54 |      1 |
|    237417 | 2003 European heat wave                           | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:54 |      1 |
|    237410 | Black Sox Scandal                                 | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:54 |      1 |
|    237402 | Eric Bischoff                                     | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:54 |      1 |
|    237399 | Comcast                                           | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:54 |      1 |
|    237390 | Conquistador                                      | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:54 |      1 |
|    237384 | Nanny                                             | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:53 |      1 |
|    237374 | Heart of Midlothian F.C.                          | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:53 |      1 |
|    237368 | Sister Souljah moment                             | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:53 |      1 |
|    237364 | Rochford                                          | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:27:53 |      1 |
|    237334 | Testicular cancer                                 | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:25:15 |      1 |
|    237325 | Sinbad the Sailor                                 | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:25:15 |      1 |
|    237314 | Heinrich Lübke                                   | 2012-04-06 22:07:55 |        0 |      4 |      2 | 2012-04-06 23:25:15 |      1 |
+-----------+---------------------------------------------------+---------------------+----------+--------+--------+---------------------+--------+
100 rows in set (0.04 sec)

I'm guessing that submission isn't working correctly, or is giving false positives. I'll have to attempt to debug it further... Reedy (talk) 16:11, 1 May 2012 (UTC) Reedy (talk) 16:11, 1 May 2012 (UTC)

Perhaps my own information is off as well? I do so many blocks that I wouldn't notice if it wasn't exact. I've had reporting errors before, sometimes a backlog of 100-300 articles on an instance of AWB which never resolves. Though I still believe it registered... but I have no evidence of whether or not it counted. All I know is I keep seeing 'List of Atlas Shrugged Characters' or something to that effect every time I load Typoscan. ChrisGualtieri (talk) 16:20, 1 May 2012 (UTC)

New AWB release time

We've not done an AWB release for a good few months, I think it's about time for one. What do you think? Thanks Rjwilmsi 19:11, 1 May 2012 (UTC)

The Signpost: 07 May 2012

Typoscan

Could you upload the new TypoScan dump. I want to begin with a clean false positive page and create a simple flow through for cross-checking the false positives and cleaning out the backlogs. When the list is complete I'll upload the false positive list for verification and help make sure all the Typos are removed from Wikipedia. It will also help refine the process of catching and correcting Typos. Patterns in false positives could then be addressed for future runs. Its kinda complicated, but I think this might be an improvement to the process. ChrisGualtieri (talk) 17:32, 14 May 2012 (UTC)

It requires a DB dump to be processed first... Reedy (talk) 18:14, 14 May 2012 (UTC)
Oh I thought it was already done and ready to go for some reason. Sorry. I thought it was processing last time we talked. ChrisGualtieri (talk) 21:27, 14 May 2012 (UTC)

The Signpost: 14 May 2012

TypoScan again

Can never seem to keep this on the talk page because of the archiving system. I've begun working on the TypoScan project again. I suppose the next time I contact you will be when its done, or just before it to request additional actions. Its only a 100k. Shouldn't take me that long.ChrisGualtieri (talk) 03:13, 22 May 2012 (UTC)

The Signpost: 21 May 2012

The Signpost: 28 May 2012

The Signpost: 04 June 2012

The Signpost: 11 June 2012

The Signpost: 18 June 2012

The Signpost: 25 June 2012

The Signpost: 02 July 2012

Request for new AWB SVN

Hi Reedy! The last AWB SVN was posted on April 15. There have been 26 changes made since then. Could you please post a new SVN when you have a chance? Thanks! GoingBatty (talk) 02:44, 6 July 2012 (UTC)

Hello? GoingBatty (talk) 01:09, 10 July 2012 (UTC)
Toolserver sucks. Reedy (talk) 12:42, 10 July 2012 (UTC)

The Signpost: 09 July 2012

The Signpost: 16 July 2012

AWB Logging to File

Magioladitis sent me to you for help, and I figured talking to you might be quicker than posting this on the AWB talk page. How exactly do I get the Logging to file feature to work? I have selected Upload, and I press apply, but the "Lines" column never goes above 2. And if I press "Turn on" it deselects the "Upload" checkbox. So am I doing something wrong? Inks.LWC (talk) 06:17, 21 July 2012 (UTC)

The Signpost: 23 July 2012

Typoscan

Almost finished with Typoscan's latest dump, could use another one in a few days. I might blitz the end and have them done by Friday night. ChrisGualtieri (talk) 04:31, 26 July 2012 (UTC)

The Signpost: 30 July 2012

The Signpost: 06 August 2012

The Signpost: 13 August 2012

How can I get a snapshot of rev 8216?

I am suffering from the bug "AWB takes an inordinate amount of time and CPU to display some pages" and am seriously looking forward to a snapshot that will fix it (8216 or higher). Does making a request somewhere do any good, or does that just annoy someone I don't want to annoy? Getting the source and compiling it sounds daunting. Even a good guess about what day such a snapshot can be expected would probably reduce my anxiety level somewhat. The fix for this bug will make life better for a lot of editors, including many who don't really know why they are experiencing so many slowdowns. Chris the speller yack 21:59, 14 August 2012 (UTC)

8267 is now up. Reedy (talk) 23:17, 14 August 2012 (UTC)
I got it, and am very grateful. Your kindness will not soon be forgotten. Chris the speller yack 01:24, 15 August 2012 (UTC)

The Signpost: 20 August 2012

Typoscan

Please check Typoscan, in desperate need of a new dump or upload. Please talkback on my page when you read this, I'm interested in continuing this project, but the workload is about to end. ChrisGualtieri (talk) 01:51, 8 August 2012 (UTC)

Under 20,000 left now. ChrisGualtieri (talk) 16:03, 9 August 2012 (UTC)
It still requires the download of a database dump [1] (8.5GB at last count), extraction to over XXGB, and then processed. I'd be somewhat interested to see how much faster this may occur on an SSD rather than a spindle hard drive, on my hex core desktop. AWB could benefit from being able to work with the database dump still compressed (an aside). Either way, it's still going to take a while.
There is also the decision of whether we just clear the current database and stats.
Reedy (talk) 16:52, 9 August 2012 (UTC)
Sounds like a lot of work, I'll probably finish up the remaining articles tomorrow. No rush, the new Regex rules will probably throw up a lot of additional articles this time around. I've got another project to do in the mean time, just like to keep the ball rolling on these things simply because the improvement to the readability of Wikipedia goes up dramatically when awkward typos are removed. Some editors had tagged the typos with clarify simply because they couldn't understand the context with the errors. As for stats, they matter little to me, but I'll grab my record after the pile is done with. Other editors might want to keep their records though, if possible. ChrisGualtieri (talk) 05:15, 10 August 2012 (UTC)
Am I the only one whose stats never showed up? GoingBatty (talk) 06:01, 10 August 2012 (UTC)
Withheld on Typoscan has 831 saves, I'd do 100 and see if that value changes, if so then your name is withheld. ChrisGualtieri (talk) 13:02, 10 August 2012 (UTC)
Will do with the next batch - thanks for the advice. If my name is withheld, how would I get it un-withheld? Thanks! GoingBatty (talk) 05:02, 15 August 2012 (UTC)
Maybe The setting in AWB Options - Preference - Privacy tab Wikipedia:AutoWikiBrowser/User_manual#Options. Regards, Sun Creator(talk) 07:24, 15 August 2012 (UTC)
That's a good guess, but I already have that box checked. GoingBatty (talk) 16:26, 15 August 2012 (UTC)
Under 5,000 untouched articles. I took out the rest of the articles and have 7,150 loaded. Not sure if that's redundancy or something, but its almost finished. ChrisGualtieri (talk) 23:43, 10 August 2012 (UTC)
Heads up, its at 100%+ now. Is that supposed to be like that? I'm still getting typos on the new page requests that I do. ChrisGualtieri (talk) 02:22, 11 August 2012 (UTC)
It won't let me pull new pages now. I shut down AWB and brought it back up... Nothing. Guess that's about it, but I did notice in those 100+ articles extra that I did, many errors were still present. Though I was skipping with a 3 to 1 ratio. ChrisGualtieri (talk) 02:28, 11 August 2012 (UTC)
I've downloaded and extracted a new snapshot onto my desktop. I'm rather un-impressed with how fast the database scanner is running. It only uses 17% of my total CPU capacity (6 core machine, "12" with hyper threading). Leaving my computer on with that amount of CPU usage is somewhat wasteful. It is multithreaded, but it's not seemingly that busy. And reading from an SSD, I'd be VERY suprised if it was I/O bound. Reedy (talk) 22:06, 11 August 2012 (UTC)
Priority limitations, CPU limiter? Maybe it doesn't run multithreads? I've had such programs that do so, but I do not know your set up or much about what you are doing, but seems TypoScan still will let people pull some files, not a large amount though. ChrisGualtieri (talk) 04:51, 12 August 2012 (UTC)
Nothing to do with TypoScan, I'm scanning a database dump. Also neither are in play, priority makes no difference. It is multi-threaded, has been for a few years. Spawns a process per the number of cpu cores -1. I can open Prime95 and it'll instantly max out all cores... Reedy (talk) 17:54, 12 August 2012 (UTC)
TypoScan won't let me grab new articles now, I'll just wait for the next upload. ChrisGualtieri (talk) 14:03, 13 August 2012 (UTC)
Will using the Pre-parse function throw off Typoscan reporting? I just discovered its existence. ChrisGualtieri (talk) 04:14, 15 August 2012 (UTC)
Any idea? (bump) ChrisGualtieri (talk) 17:08, 17 August 2012 (UTC)
No, it's fine. It'll mark any skipped articles as skipped. Not saved won't be affected. Reedy (talk) 23:44, 17 August 2012 (UTC)
Efficiency boost! Any ETA on the next update? I'm almost done with my next Wikiproject work. ChrisGualtieri (talk) 00:42, 18 August 2012 (UTC)
Bump to prevent bot from archiving, in case you've been away. ChrisGualtieri (talk) 14:32, 20 August 2012 (UTC)

Nomination of Levógiro for deletion

A discussion is taking place as to whether the article Levógiro is suitable for inclusion in Wikipedia according to Wikipedia's policies and guidelines or whether it should be deleted.

The article will be discussed at Wikipedia:Articles for deletion/Levógiro until a consensus is reached, and anyone is welcome to contribute to the discussion. The nomination will explain the policies and guidelines which are of concern. The discussion focuses on high-quality evidence and our policies and guidelines.

Users may edit the article during the discussion, including to improve the article to address concerns raised in the discussion. However, do not remove the article-for-deletion template from the top of the article. LuciferWildCat (talk) 06:08, 26 August 2012 (UTC)

The Signpost: 27 August 2012

Next TSN dump

Any ideas on a date for the next dump to go up? ChrisGualtieri (talk) 04:03, 27 August 2012 (UTC)

Ditto that question. Regards, Sun Creator(talk) 11:11, 29 August 2012 (UTC)
Actually, how do I build a list? I downloaded the database dump and tried to scan it with the Typo search option on AWB's Database Scanner, but it takes forever and the articles it flags do NOT have typos. I really want to continue with the project, but I cannot figure out what to do, if its doable I could probably try and provide a list. ChrisGualtieri (talk) 15:15, 29 August 2012 (UTC)
That's the reason I've given up for now (per my previous reply saying it wasn't using all cores correctly). There's no point doing it if my machine is going to sit mostly idle fopr days, it's wasting power and time.. It is going to take ages, purely by the size of the dump and the amount of articles/text. i.e. 37GB worth. The dump is dated 20120802, so anything that has been changed since 20120802 won't have typos. I can't remember exactly how it's done, but for the dumps (I think), an active database slave is used, and it just works through them one at a time, rather than just a snapshot. Some articles will hence be out of date (noting the dump is now 4 weeks old)... Reedy (talk) 12:41, 30 August 2012 (UTC)
I'm processing a list now. AWB's datascanner must not be doing it fast enough, because I got a data queue of 9000 minutes remaining. It is only using 2 cores, that's why. If I knew a work around I'd do that, but otherwise it'll take a literal week to get the results. ChrisGualtieri (talk) 13:17, 30 August 2012 (UTC)
It spawns the correct number of worker threads (one per core - 1), but there isn't any core affinity set to them, and .NET seems to do a crappy job of making all cores busy. Reedy (talk) 16:24, 30 August 2012 (UTC)

The Signpost: 03 September 2012

The Signpost: 10 September 2012

TypoScan status update

Heya old buddy! Long time, no wiki! Does TypoScan still work? What's the status of it? When I pull in TypoScan with the latest AWB the only result I get is "List of Atlas Shrugged characters." Any idea on that?

I'm going to be running RETF against the latest dump this week if you need it.

Regards, --mboverload@ 05:55, 16 September 2012 (UTC)

All the pages in the database are "fixed". AWB was being sucky when I tried running a scan over a newer DB dump, and was using little CPU time, meaning most of the 6 cores on my desktop were sitting pretty idle! I got annoyed and decided I wasn't leaving my computer on doing little work for multiple days :( Reedy (talk) 00:44, 17 September 2012 (UTC)
My own scan came up with largely useless ones through an regex with my estimated time to completion being measured in weeks of 24/7 running. Still interested. ChrisGualtieri (talk) 01:58, 17 September 2012 (UTC)

The Signpost: 17 September 2012

The Signpost: 24 September 2012

The Signpost: 01 October 2012

TypoScan

I'm downloading the latest dump. Once downloaded can you explain how I might go about updating WP:TYPOSCAN with the latest information? I'd like to know how you create the list of articles from the database dump and how that list can be re-uploaded into the TypoScan list. Regards, Sun Creator(talk) 18:34, 3 October 2012 (UTC)

The Signpost: 08 October 2012

TypoScan

I'm downloading the latest dump. Once downloaded can you explain how I might go about updating WP:TYPOSCAN with the latest information? I'd like to know how you create the list of articles from the database dump and how that list can be re-uploaded into the TypoScan list. Regards, Sun Creator(talk) 18:34, 3 October 2012 (UTC)

Bump

Solicitud

Hola Reedy, podrias ver esto. Gracias Deivismaster (talk) 00:01, 14 October 2012 (UTC)

The Signpost: 15 October 2012

Bugzilla for exceptions?

What do you think of creating a component in Bugzilla for AWB bugs, especially for exceptions? It seems some bugs are API related. We could also catch more attention from bug fixers for exceptions.

If yes, who do you think could be default assignee for reports against AWB? (Does not necessarily have to be a person, An account like "awb-bugs@example.xyz" could be set up.

Your opinion is strongly required. -- Magioladitis (talk) 19:57, 18 October 2012 (UTC)

The Signpost: 22 October 2012

Request for new SVN

Hi Reedy! Per Wikipedia talk:AutoWikiBrowser#Multiple issues, could you please post a new SVN snapshot for us? Thanks! GoingBatty (talk) 01:09, 18 October 2012 (UTC)

Yes, please. -- Magioladitis (talk) 10:00, 18 October 2012 (UTC)
Hi Reedy! I see SVN 8470 was posted. However, based on Wikipedia_talk:AutoWikiBrowser/Bugs#Can.27t_open_AWB this bug report, could you please post a new snapshot? Thanks again! GoingBatty (talk) 16:56, 19 October 2012 (UTC)
Pretty please? GoingBatty (talk) 00:46, 27 October 2012 (UTC)
8506 is up. -- Magioladitis (talk) 00:08, 29 October 2012 (UTC)

The Signpost: 29 October 2012

The Signpost: 05 November 2012

Hello, Reedy, and thank you for your contributions!

Some text in an article that you worked on Jake Bugg/FixCites, appears to be directly copied from another Wikipedia article, Jake Bugg. Please take a minute to double-check that you've properly attributed the source text in your edit summary.

It's entirely possible that this bot made a mistake, so please feel free to remove this notice and the tag it placed on Jake Bugg/FixCites at any time. MadmanBot (talk) 22:14, 12 November 2012 (UTC)

The Signpost: 12 November 2012

The Signpost: 19 November 2012

LaTeX

Dear Reedy,

Mathematicians and scientists and engineers and even economists will love you and erect shrines to you if you will let us use LaTeX packages on Wikipedia. :)

<3

Sue Gardner could improve her standing by making this a priority! :)

I have spent weeks wrestling with trying to convert LaTeX to PDF (trivial) as snapshots to PNG (not so fun), and finally to SVG (an utter failure). Always hogging memory for my pictures, PNG looks blurry on IE Explorer in Windows, and even on Ubuntu and Macs looks cheap.

I'll probably ditch my LaTeX code and redo my guitar codes in Lilypond, which allows exporting SVG files. Others just give up.

Thanks!

Best regards, Kiefer.Wolfowitz 23:41, 27 November 2012 (UTC)

Unfortunately, this might not be as simple as it initially sounds.. Score is due for deployment (it is currently halfway there, as it is on test2wiki at least), but has a couple of blocker issues before widescale deployment can happen.
What is wrong/insufficient/broken about the Math extension? Unfortunately, I'm not overly familiar with it, so can't advise as much as to what might need to happen.
If we can actually identify what is wrong/needs doing, we can get a bug logged in the open where people can actually see it... Reedy (talk) 11:52, 28 November 2012 (UTC)

The Signpost: 26 November 2012

The Signpost: 03 December 2012

The Signpost: 10 December 2012

The Signpost: 17 December 2012

Merry Christmas!

The Signpost: 24 December 2012